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Description 

RECOMBINANT ALPHA VIRUS-BASED VECTORS WITH REDUCED 
INHIBITION OF CELLULAR MACROMOLECULAR SYNTHESIS 

5 

Statement of Government Interest 

This invention has been made in part with government support under 
grant number AI 17377, awarded by the National Institutes of Health. The government 
may have certain rights in the invention. 

10 

Technical Field of the Invention 

The present invention relates generally to recombinant DNA technology; 
and more specifically, to the development of recombinant vectors useful for directing 
the expression of one or more heterologous gene products. 

15 

Background of the Invention 

Alphaviruses comprise a set of serologically related arthropod-borne 
viruses of the Togaviridae family. These viruses are distributed worldwide, and persist 
in nature through a mosquito to vertebrate cycle. Birds, rodents, horses, primates, and 

20 humans are among the defined alphavirus vertebrate reservoir/hosts. 

Twenty-six known viruses and virus subtypes have been classified 
within the alphavirus genus utilizing the hemagglutination inhibition (HI) assay. This 
assay segregates the 26 alphaviruses into three major complexes: the Venezuelan 
equine encephalitis (VEE) complex, the Semliki Forest (SF) complex, and the western 

25 equine encephalitis (WEE) complex. In addition, four other viruses, eastern equine 
encephalitis (EEE), Barmah Forest, Middelburg, and Ndumu, receive individual 
classification based on the HI serological assay. 

Members of the alphavirus genus also are classified based on their 
relative clinical features in humans: alphaviruses associated primarily with 

30 encephalitis, and alphaviruses associated primarily with fever, rash, and polyarthritis. 
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Included in the former group are the VEE and WEE complexes, and EEE. In general, 
infection with this group can result in permanent sequelae, including behavior changes 
and learning disabilities, or death. In the latter group is the SF complex, comprised of 
the individual alphaviruses Semliki Forest, Sindbis, Ross River, Chikungunya, 
5 O'nyong-nyong, and Mayaro. With respect to this group, although serious epidemics # 
have been reported, infection is in general self-limiting, without permanent sequelae. 

Sindbis vims is the prototype member of the Alphavirus genus of the 
Togaviridae family. Its replication strategy after infection of cells {see Figure 1) has 
been well characterized in chicken embryo fibroblasts (CEF) and baby hamster kidney 

10 (BHK) cells, where Sindbis virus grows rapidly and to high titer, and serves as a model 
for other alphaviruses. Briefly, the genome from Sindbis virus (like other alphaviruses) 
is an approximately 12 kb single-stranded positive-sense RNA molecule which is 
capped and polyadenylated, and contained within a virus-encoded capsid protein shell. 
The nucleocapsid is further surrounded by a host-derived lipid envelope into which two 

15 viral-specific glycoproteins, El and E2, are inserted and anchored to the nucleocapsid. 
Certain alphaviruses (e.g., SF) also maintain an additional protein, E3, which is & 
cleavage product of the E2 precursor protein, PE2. After virus particle absorption to 
target cells, penetration, and uncoating of the nucleocapsid to release viral genomic 
RNA into the cytoplasm, the replicative process is initiated by translation of the 

20 nonstructural proteins (nsPs) from the 5* two-thirds of the viral genome. The four nsPs 
(nsPl-nsP4) are translated directly from the genomic RNA template as one of two 
polyproteins (nsP123 or nsP1234), and processed post-translationally into monomeric 
units by an active protease in the C-terminal domain nsP2. A leaky opal (UGA) codon 
present between nsP3 and nsP4 of most alphaviruses accounts for a 10 to 20% 

25 abundance of the nsP1234 polyprotein, as compared to the nsP123 polyprotein. Both of 
the nonstructural polyproteins and their derived monomeric units may participate in the 
RNA replicative process, which involves binding to the conserved nucleotide sequence 
elements (CSEs) present at the 5' and 3 1 ends, and a junction region subgenomic 
promoter located internally in the genome (discussed further below). 
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The positive strand genomic RNA serves as template for the nsP- 
catalyzed synthesis of a full-length complementary negative strand. Synthesis of the 
complementary negative strand is catalyzed after binding of the nsP complex to the 3 1 
terminal CSE of the positive strand genomic RNA. The negative strand, in turn, serves 
5 as template for the synthesis of additional positive strand genomic RNA and an 
abundantly expressed 26S subgenomic RNA, initiated internally at the junction region 
promoter. Synthesis of additional positive strand genomic RNA occurs after binding of 
the nsP complex to the 3' terminal CSE of the complementary negative strand genomic 
RNA template. Synthesis of the subgenomic mRNA from the negative strand genomic 

10 RNA template, is initiated from the junction region promoter. Thus, the 5* end and 
junction region CSEs of the positive strand genomic RNA are functional only after they 
are transcribed into the negative strand genomic RNA complement (i.e., the 5' end CSE 
is functional when it is the 3' end of the genomic negative stranded complement). The 
structural proteins (sPs) are translated from the subgenomic 26S RNA, which represents 

15 the 3' one-third of the genome, and like the nsPs, are processed post-translationally into 
the individual proteins. 

Several groups have suggested utilizing certain members of the 
alphavirus genus as an expression vector, including, for example, Sindbis virus (Xiong 
et al.. Science 243:1 188-1 191, 1989; Hahn et al., Proc. Natl. Acad Set USA 89:2679- 

20 2683, 1992; Duben y et al., J. Virol 70:508-519, 1996), Semliki Forest virus 
(Liljestrom, Bio/Technology 9: 1356- 1361, 1991), and Venezuelan Equine Encephalitis 
virus (Davis et ah, 1 Cell Biochem. Suppi 79/1:10, 1995). In addition, one group has 
suggested using alphavirus-derived vectors for the delivery of therapeutic genes in vivo. 
One difficulty, however, with the above-referenced vectors is that inhibition of host 

25 cell-directed macromolecular synthesis (/.*., protein or RNA synthesis) begins within a 
few hours after infection and cytopathic effects (CPE) occur within 12 to 16 hours post 
infection (hpi). Inhibition and shutoff of host cell protein synthesis begins within 2 hpi 
in BHK cells infected with recombinant viral particles, in the presence or absence of 
structural protein expression, suggesting that the early events after vims infection (e.g., 
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synthesis of nsPs and minus strand RNA) may directly influence the inhibition of host 
cell protein synthesis and subsequent development of CPE and cell death. 

SIN-1 is a variant strain derived from wild-type Sindbis* and was 
isolated from a culture of BHK cells persistently infected with Sindbis virus over a 
5 period of one month (Weiss et al. J. Virol 33: 463-474, 1980). A pure SIN-1 virus 
stock obtained by expansion from a single plaque does not kill the BHK cells which it 
infects. Importantly, virus yields (>10 3 PFU/cell) are the same in BHK cells infected 
with wild-type Sindbis virus or the variant SIN-1 virus. Thus, the principle phenotype 
of SIN-1 in infected BHK cells is characterized by production of wild-type levels of 
10 infectious virus in the absence of virus-induced cell death. 

The present invention provides recombinant vectors with selected 
desirable phenotypes for use in a variety of applications, including for example, gene 
therapy and recombinant protein production, and further provides other related 
advantages. 

15 

Summary of the Invention 

Briefly stated, the present invention provides RNA vector replicons, 
alphavirus vector constructs, eukaryotic layered vector initiation systems and 
recombinant alphavirus particles which exhibit reduced, delayed, or no inhibition of 

20 cellular macromolecular synthesis (e.g., protein or RNA synthesis), thereby permitting 
the use of these vectors for protein expression, gene therapy and the like, with reduced, 
delayed, or no development of CPE or cell death. Such vectors may be constructed 
from a wide variety of alphaviruses {e.g., Semliki Forest virus, Ross River virus, 
* Venezuelan equine encephalitis virus or Sindbis virus), and designed to express 

25 numerous heterologous sequences (e.g., a sequence corresponding to protein, a 
sequence corresponding to antisense RNA, a sequence corresponding to non-coding 
sense RNA, or a sequence corresponding to ribozyme). 

Within one aspect of the invention, isolated nucleic acid molecules are 
provided comprising an altered alphavirus nonstructural protein gene which, when 

30 operably incorporated into a recombinant alphavirus, increases the time required to 
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reach 50% inhibition of host-cell directed macromolecuiar synthesis following 
expression in mammalian cells, as compared to a wild-type alphavirus. As utilized 
within the context of the present invention, "altered alphavirus nonstructural protein 
gene" refers to a gene which, when operably incorporated into an alphavirus RNA 
5 vector replicon, recombinant alphavirus particle, or eukaryotic layered vector initiation 
system, produces the desired phenotype (e.g., reduced, delayed or no inhibition of 
cellular macromolecuiar synthesis). Such altered alphavirus nonstructural protein genes 
will have one or more nucleotide substitutions, deletions, or insertions, which alter the 
nucleotide sequence from that of the wild-type alphavirus gene. The gene may be 
0 derived either artificially (e.g., from directed selection procedures; see Example 2 
below), or from naturally occurring viral variants (see Example 1 below). In addition, it 
should be understood that when the isolated nucleic acid molecules of the present 
invention are incorporated into an alphavirus RNA vector replicon, recombinant 
alphavirus particle, or eukaryotic layered vector initiation system as discussed above, 
5 that they may, within certain embodiments, substantially increase the time required to 
reach 50% inhibition of host-cell directed macromolecuiar synthesis, up to aid 
including substantially no detectable inhibition of host-cell directed macromolecuiar 
synthesis. Assays suitable for detecting percent inhibition of host-cell directed 
macromolecuiar synthesis include, for example, that described within Example 1 . 

Within other aspects of the invention, isolated nucleic acid molecules are 
provided comprising an altered alphavirus nonstructural protein gene which, when 
operably incorporated into a recombinant alphavirus particle, eukaryotic layered vector 
initiation system, or RNA vector replicon, results in a reduced level (e.g., 2-fold, 5-fold, 
10-fold, 50-fold or more than 100-fold) of vector-specific RNA synthesis as compared 
to the wild-type, and the same or greater level of protein encoded by RNA transcribed 
from the viral junction region promoter, as compared to a wild-type recombinant 
alphavirus particle, wild-type eukaryotic layered vector initiation system, or wild-type 
RNA vector replicon. Representative assays for quantitating RNA levels include [ 3 H] 
uridine incorporation as described in Example 1 , or RNA accumulation as detected by 
Northern Blot analysis (see Example 4). Representative assays for quantitating protein 
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levels include scanning densitometry (see Example 4) and various enzymatic assays 
(see Examples 3-5). 

Within one embodiment of the above, the isolated nucleic acid molecule 
encodes nonstructural protein 2 (nsP2). Within a further embodiment, the isolated 
5 nucleic acid molecule has a mutation in the LXPGG motiffof nsP2. 

Within another aspect of the invention, expression vectors are provided 
comprising a promoter operably linked to one of the above-described nucleic acid 
molecules. Within one embodiment, the expression vector farther comprises a 
polyadenylation sequence or transcription termination sequence 3* to the nucleic acid 
10 molecule. 

Within yet another aspect of the present invention, alphavirus vector 
constructs are provided, comprising a 5* promoter which initiates synthesis of viral 
RNA in vitro from cDNA, a 5' sequence which initiates transcription of alphavirus 
RNA, a nucleic acid molecule which operably encodes all four alphaviral nonstructural 

15 proteins including an isolated nucleic acid molecule as described above, an alphavirus 
RNA polymerase recognition sequence and a 3* polyadenylate tract. Within orte 
embodiment, such constructs further comprise a selected heterologous sequence 
downstream of and operably linked to a viral junction region. 

Within yet other aspects of the present invention, RNA vector replicons 

20 capable of translation in a eukaryotic system are provided, comprising a 5' sequence 
which initiates transcription of alphavirus RNA, a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins, including the isolated nucleic acid 
molecules discussed above, an alphavirus RNA polymerase recognition sequence and a 
~ 3 f polyadenylate tract. Within another embodiment, such RNA vector replicons further 

25 comprise a selected heterologous sequence downstream of and operably linked to a viral 
junction region. Within further aspects of the invention, host cells are provided which 
contain one of the RNA vector replicons described herein. Within additional aspects of 
the invention, pharmaceutical compositions are provided comprising RNA vector 
replicons as described above and a pharmaceutically acceptable carrier or diluent. 
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Within other aspects of the invention, recombinant alphavirus particles 
are provided, comprising one or more alphavirus structural proteins, a lipid envelope, 
and an RNA vector replicon as described herein. Within one embodiment, one or more 
of the alphavirus structural proteins are derived from a different alphavirus than the 
5 alphavirus from which the RNA vector replicon was derived. Within other 
embodiments, the alphavirus structural protein and lipid envelopes are derived from 
different species. Within further aspects, pharmaceutical compositions are provided 
comprising a recombinant alphavirus particle as disclosed above and a pharmaceutically 
acceptable carrier or diluent. Further, mammalian cells infected with such recombinant 
1 0 alphavirus particles are also provided. 

Within certain embodiments of the invention, the above described 
vectors or particles may further comprise a resistance marker which has been fused, in- 
frame, with the heterologous sequence. Representative examples of such resistance 
markers include hygromycin phosphotransferase and neomycin. 
1 5 Within other aspects of the present invention, methods are provided for 

selecting alphavirus or recombinant alphavirus vector variants which exhibit -the 
phenotype described herein of reduced, delayed, or, no inhibition of host cell directed 
macromolecular synthesis. Representative examples of such methods include the use of 
selectable drug or antigenic markers and are provided in more detail below in Example 
20 2. 

Within other aspects of the present invention. Togavirus capsid particles 
are provided which contain substantially no genomic (i.e., wild-type virus genome) or 
RNA vector replicon nucleic acids. Representative examples of Togaviruses include, 
for example alphaviruses and rubiviruses (e.g., rubella). Within certain embodiments, 

25 the capsid particles further comprise a lipid envelope containing one or more alphavirus 
glycoproteins. Within other embodiments, the capsid particle further comprises an 
alphavirus envelope (i.e., the lipid bilayer and the glycoprotein complement). Within 
related aspects of the present invention, pharmaceutical compositions are provided 
comprising the above noted capsid particles (with or without a lipid bilayer (e.g.. viral 

30 envelope containing alphavirus glycoproteins)) along with a pharmaceutically 
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acceptable carrier or diluent. Within further aspects, such capsid particles (with or 
without a lipid bilayer (e.g., viral envelope containing alphavirus glycoproteins)) or 
pharmaceutical compositions may be utilized as a vaccinating agent in order to induce 
an immune response against a desired togavirus. 
5 Within further aspects of the invention, inducible promoters are provided 

comprising a core RNA polymerase promoter sequence, an operably linked nucleic acid 
sequence that directs the DNA binding of a protein that activates transcription from the 
core promoter sequence, and an operably linked nucleic acid sequence that directs the 
DNA binding of a protein that represses transcription from the core promoter sequence. 

10 Such promoters may be utilized in the gene delivery vehicles described herein, as well 
as a wide variety of other vectors known to those skilled in the art. 

Within other aspects, alphavirus structural protein expression cassettes 
are provided comprising a 5' promoter which initiates synthesis of viral RNA from 
DNA. a nucleic acid molecule which encodes one or more functional alphavirus 

1 5 structural proteins, a selectable marker operably linked to transcription of the expression 
cassette, and optionally, a 3' sequence which controls transcription termination. Withfli 
one embodiment, such expression cassettes further comprise a 5' sequence which 
initiates transcription of alphavirus RNA, a viral junction region promoter, and an 
alphavirus RNA polymerase recognition sequence. Within another embodiment, the 

20 expression cassette further comprises a catalytic ribozyme processing sequence, post- 
translational transcriptional regulatory elements which facilitate RNA export from the 
nucleus, and/or elements which permit translation of multicistronic mRNA, selected 
from the group consisting of Internal Ribosome Entry Site elements, elements 
* promoting ribosomal read through and BiP sequence. Within other embodiments, the 

25 selectable marker is operably linked to a 5* promoter capable of initiating synthesis of 
alphavirus RNA from cDNA. Within further embodiments, the selectable marker is 
positioned downstream from a junction region promoter and from the nucleic acid 
molecule which encodes alphavirus structural proteins. Within yet other embodiments, 
the 5' promoter is an inducible promoter as described herein. Within another 

30 embodiment, the alphavirus structural protein expression cassette further comprises an 
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alphavirus capsid protein gene or other sequence (e.g., a tobacco etch virus or "TEV" 
leader) which is capable of enhancing translation of one or more functional alphavirus 
structural protein genes located 3' to the enhancer sequence. Preferably, the capsid 
protein gene sequence is derived from a different alphavirus than that from which the 
5 sequence encoding the alphavirus structural genes is obtained. 

Within yet other aspects of the invention, alphavirus packaging cell lines 
are provided comprising a cell containing an alphavirus structural protein expression 
cassette as described above. In certain embodiments, the alphavirus packaging cell 
lines are stably transformed with the alphavirus structural protein expression cassettes 
10 provided herein. Within related aspects, alphavirus producer cell lines are provided 
comprising a cell which contains a stably transformed alphavirus structural protein 
expression cassette, and a vector selected from the group consisting of RNA vector 
replicons, alphavirus vector constructs and eukaryotic layered vector initiation systems. 

Within yet other aspects of the present invention, eukaryotic layered 
15 vector initiation systems are provided, comprising a 5' promoter capable of initiating 
in vivo the 5' synthesis of RNA from cDNA, a sequence which initiates transcription of 
alphavirus RNA following the 5* promoter; a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins, including an isolated nucleic acid 
molecule as discussed above, an alphavirus RNA polymerase recognition sequence, and 
20 a 3' polyadenylate tract. Representative examples of suitable 5' promoters include RNA 
polymerase I promoters, RNA polymerase II promoters, RNA polymerase III 
promoters, the HSV-TK promoter, RSV promoter, tetracycline inducible promoter, 
MoMLV promoter, a SV40 promoter and a CMV promoter. Within preferred 
embodiments, the 5' promoter is an inducible promoter as described herein. 
25 Within certain embodiments, eukaryotic layered vector initiation systems 

are provided which further comprise a heterologous sequence operably linked to a viral 
junction region, and/or a post-transcriptional regulatory element which facilitates RNA 
export from the nucleus. Within further embodiments, the eukaryotic layered vector 
inititation systems provided herein may further comprise a transcription termination 
30 signal. 
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Within related aspects, the present invention also provides host cells 
(e.g., vertebrate or insect) containing a stably transformed eukaryotic layered vector 
initiation system as described above. Within further aspects of the present invention, 
methods for delivering a selected heterologous sequence to a vertebrate or insect are 
5 provided, comprising the step of administering to a vertebrate or insect an alphavirus 
vector construct, RNA vector replicon, recombinant alphavirus particle, or a eukaryotic 
layered vector initiation system as described herein. Within certain embodiments, the 
alphavirus vector construct, RNA vector replicon, recombinant alphavirus particle or 
eukaryotic layered vector initiation system is administered to cells of the vertebrate 
10 ex vivo, followed by administration of the vector or particle-containing cells to a warm- 
blooded animal. 

Within other aspects, pharmaceutical compositions are provided 
comprising a eukaryotic layered vector initiation system as discussed above, and a 
pharmaceutical^ acceptable carrier or diluent. Within certain embodiments, the 

15 pharmaceutical composition is provided as a liposomal formulation. 

Within further aspects, methods of making recombinant alphavirtts 
particles are provided, comprising the steps of (a) introducing a vector such as a 
eukaryotic layered vector initiation system, RNA vector replicon, or alphavirus vector 
particle as described above into a population of packaging cells under conditions and 

20 for a time sufficient to permit production of recombinant alphavirus particles, and 
(b) harvesting recombinant alphavirus particles. Within related aspects, methods of 
making a selected protein are provided, comprising the steps of (a) introducing a vector 
which encodes a selected heterologous protein, such as a eukaryotic layered vector 
initiation system, RNA vector replicon or alphavirus vector particle described above, 

25 into a population of packaging cells, or other cells under conditions and for a time 
sufficient to permit production of the selected protein, and (b) harvesting protein 
produced by the vector containing cells. Within yet other aspects, methods of making a 
selected protein are provided, comprising the step of introducing a eukaryotic layered 
vector initiation system which is capable of producing a selected heterologous protein 

30 into a host cell, under conditions and for a time sufficient to permit expression of the 
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selected protein. Within further aspects, host cell lines are provided which contain a 
RNA vector replicon as described herein. 

Within yet other aspects of the present invention, alphavirus vaccines are 
provided, comprising one of the above-described alphavirus vector constructs, RNA 
5 vector replicons, eukaryotic vector initiation systems, or recombinant alphavirus 
particles, which may or may not express one of the heterologous sequences provided 
herein (e.g., they may be utilized solely as a vaccine for treating or preventing 
alphaviral diseases). For example, within one embodiment of the invention, 
recombinant togavirus particles are provided which have substantially no nucleic acid 
10 or RNA vector replicon nucleic acid. Within a further embodiment, recombinant 
togavirus particles are provided which contain hetereologous viral nucleic acids (i.e., 
from a different virus than the togavirus particle). Within yet another embodiment, the 
recombinant togavirus particle is T=3 or greater. 

Within further aspects of the invention, recombinant chimeric togavirus 
15 particles (either empty, or containing nucleic acids) are provided wherein the viral 
particle has viral structural components obtained or derived from different Togaviritiae 
(e.g., the capsid protein and glycoprotein is obtained from different alphavirus sources). 

These and other aspects and embodiments of the invention will become 
evident upon reference to the following detailed description and attached figures. In 
20 addition, various references are set forth herein that describe in more detail certain 
procedures or compositions (e.g., plasmids, sequences, etc.), and are therefore 
incorporated by reference in their entirety as if each were individually noted for 
incorporation. 

25 Brief Description of the Figures 

Figure 1 is a schematic illustration of Sindbis genome organization and 
replication strategy. 

Figure 2 is a graph of virus release from BHK cells infected at an MOI 
of 10 with SIN-1, SIN-l/nsPl-4, Totol 101, or Sin-l/nsP2 viruses. Culture supernates 
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were collected at 3, 6, 9 and 12 hours post-infection. Virus titers were determined by 
plaque assay. 

Figure 3 is a graph depicting viral RNA synthesis in BHK cells 
following infection by TotollOl, SIN-l/nsP2, SIN-l/nsPl-4, or SIN-1 virus. Cells 
5 were infected at an MOI 10 and at 1 hour post-infection, actinomycin D and 3 H-uridine 
were added. At 3, 6 ? 9, and 12 hpi the amount of 3 H-uridine incorporation was 
determined. 

Figure 4 is a graph depicting viral RNA synthesis in BHK cells infected 
by SIN-l/nsPl, SIN-l/nsP2, SIN-l/nsP3, SIN-l/nsP3-4, SIN-l/nsP4, SlN-l/nsP2-C 
10 SIN-l/nsP2-N, TotollOK SIN-1, or SIN-l/nsPl-4. The levels of 3 H-uridine 
incorporation are expressed relative to wild-type (Toto 1101) infection. 

Figure 5 is a graph depicting the shut-off of host cell protein synthesis in 
BHK cells infected by SIN-InsPl-4, SIN-1, SIN-lnsP2, or TotollOl viruses. 

Figure 6 is the cDNA sequence of 8000 bases of SIN-1 virus (SEQ. ID 

15 NO. 101). 

Figure 7 is the cDNA sequence of 8000 bases of SINCG virus (SEQ. ID 

NO. 102). 

Figure 8 is the cDNA sequence of Toto 1 101 virus (SEQ. ID NO. 103). 

Figure 9 A is a northern blot of RNAs isolated from BHK-21 cells that 
20 were transfected with pBG/SIN-1 ELVS 1.5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmid DNAs, and hybridized with a radiolabeled viral RNA probe. Figure 9B is a 
graph depicting a 7 day timecourse of alkaline phosphatase expression in BHK cells 
transfected with pBG/SIN-1 ELVS 1.5-SEAP or pBG/wt ELVS 1.5-SEAP plasmid 
DNAs. 

25 Figure 10 is a graph depicting a 4 day timecourse of luciferase 

expression in BHK cells transfected with pBG/SIN-1 ELVS 1.5-luc or pBG/wt ELVS 
1 .5-luc plasmid DNAs. 

Figure 11A is a northern blot of RNAs isolated from BHK-21 cells that 
were transfected with pBG/SIN-1 ELVS-1.5-p-gal or pBG/wt ELVS 1.5-p-gal plasmid 

30 DNAs, and hybridized with a radiolabeled viral RNA probe. Figure 1 IB is a western 
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blot detecting p-gal expression in BHK-21 cells transfected with either pBG/SIN-1 
ELVS-l.5-p.gal or pBG/wt ELVS 1.5-p-gal plasmid DNAs. Figure 11C is a graph 
depicting a 5 day timecourse of alkaline phosphatase expression in BHK cells 
transfected with pBG/SIN-1 ELVS-1.5-P-gal or pBG/wt ELVS 1.5-p-gal plasmid 
5 DNAs. 

Figure 12A & B are graphs depicting P-gal expression in HT1080 and 
BHK-21 cells transfected with ELVS p-gal vectors with or without HBV PRE 
sequences, as measured by RLU (relative light units). 

Figure 13 is a schematic illustration of RNA amplification, structural 
10 protein expression, and vector packaging by vector inducible alphavirus packaging cell 
lines. 

Figure 14 is a schematic illustration of vector inducible structural protein 
expression cassettes used in the generation of alphavirus packaging cell lines. 

Figure 15 is a graph depicting luciferase vector packaging (transfer of 
1 5 expression) by different alphavirus packaging cell lines. 

Figure 16 is a western blot depicting vector inducible structural protein 
expression by an alphavirus packaging cell line. 

Figure 17A is a graph depicting luciferase vector packaging by C6/36 
mosquito cells containing the pDCMV-intSINrbz structural protein expression cassette. 
20 Figure 17B is a graph depicting luciferase vector packaging by human 293 packaging 
cells stably transformed with plasmid pBGSVCMVdlneo. 

Figure 1 8 is a graph depicting luciferase vector packaging by different 
alphavirus packaging cell lines. 

Figure 19 is an RNA gel autoradiograph depicting 3 H uridine-labeled 
25 RNAs from BHK cells infected with SINrep/LacZ vector particles produced from an 
alphavirus packaging cell line. 

Figure 20 is a protein gel autoradiograph depicting 35 S methionine- 
Iabeled proteins from BHK cells infected with SINrep/LacZ vector particles produced 
from an alphavirus packaging cell line. 
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Figure 21 is a western blot depicting vector inducible capsid protein 
expression following p-gal or glycoprotein vector transfection of a capsid cell line. 

Figure 22 is a schematic illustration of the region of structural protein 
expression cassettes comprising a wild-type or deletion mutant Ross River virus capsid 
5 protein gene. 

Figure 23 is a schematic illustration of vector inducible structural protein 
expression cassettes containing a wild-type or deletion mutant Ross River virus capsid 
protein gene. 

Figure 24 is a schematic illustration of vector packaging by "split" 
10 structural protein gene expression cassettes which contain a Ross River virus capsid 
protein gene sequence upstream of the Sindbis virus glycoprotein genes. 

Figure 25 is a schematic illustration of vector packaging by "split" 
structural protein gene expression cassettes which contain a Ross River virus capsid 
protein gene sequence upstream of the Sindbis virus glycoprotein genes on one cassette, 
1 5 and the Sindbis virus capsid protein gene in a separate cassette. 

Figure 26 is a table showing the results of vector particle packaging 
using the above "split" structural protein gene expression cassettes. 

Figure 27 is a schematic illustration of the use of alphavirus packaging 
cell lines for the amplification of packaged vector particle preparations and the large 
20 scale production of recombinant protein. 

Figure 28 is a graph depicting the amplification and production of p-gal 
protein over time using alphavirus packaging cell lines. 

Figure 29 is a schematic illustration of the use of a tetracycline regulated 
- promoter system to control expression of alphavirus vector RNA from cDNA in vivo, 
25 Figure 30 is a schematic illustration of the use of a linked transcriptional 

repressor and a transcriptional inducer/activator regulated promoter system to control 
expression of alphavirus vector RNA from cDNA in vivo. 

Figure 31 A & B are autoradiographs of [ 3 H]uridine-labeIed RNAs 
electrophoresed on denaturing glyoxal gels that were isolated from BHK cell 
30 electroporated with SINrep/LacZ replicon and DH RNAs from various RRV capsid 
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containing DH constructs, and from the vector particles present in the culture fluids at 
18 hours post electroporation. 

Figure 32A & B are protein gel autoradiographs depicting 35 S 
methionine-labelled proteins from BHK cells electroporated with SINrep/LacZ replicon 
5 and DH RNAs from various RRV capsid containing DH constructs, and from the vector 
particles present in the culture fluids at 1 8 hours post electroporation. 

Figure 33A-D are Kyte-Doolittle hydrophobicity plots of various Ross 
River virus (RRV) capsid proteins, expressed from the wild-type gene (A) and three 
deletion mutants CAlrrv, CA2rrv, and CA3rrv (B-D, respectively). 
10 Figure 34 is a schematic that illustrates the amino-terminus RRV capsid 

proteins expressed from the wild-type gene (SEQ. ID NO. 114), and three deletion 
mutants CAlrrv (SEQ. ID NO. 1 15), CA2rrv (SEQ. ID NO. 116), and CA3rrv (SEQ. ID 
NO. 117). The lysine residues deleted in the RRV capsid gene mutants are indicated. 

Figure 35 is a graph that illustrates the relative levels of [ 35 S]methionine 
1 5 and [ 3 H]uridine incorporated into virus particles in BHK cells infected at high MOI with 
Toto 1101 wild-type virus. 

Figure 36 is a graph that illustrates the relative levels of [ 35 S] methionine 
and [ 3 H]uridine incorporated into virus particles in BHK cells electroporated with 
SINrep/lacZ and DH-BB (5* tRNA) Crrv DH RNAs. 
20 Figure 37 is a graph that illustrates the relative levels of [ 35 S]methionine 

and [ 3 H]uridine incorporated into virus particles in BHK cells electroporated with 
SINrep/lacZ and RRV capsid deletion mutant DH-BB (5* tRNA) CA3rrv DH RNAs. 

Figure 38 is a graph which compiles the results shown in Figures 35-37, 
depicting the relative levels of [ 35 S]methionine and [ 3 H]uridine incorporated into virus 
25 particles in BHK cells electroporated with SINrep/lacZ and DH RNAs, or infected with 
TotollOl wild-type virus. 

Figure 39 is a graph which illustrates luciferase vector packaging by 
BHK cells stably transformed with pBGSVCMVdlhyg. 
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Definition of Terms 

The following terms are used throughout the specification. Unless 
otherwise indicated, these terms are defined as follows: 

" Genomic RNA " refers to RNA which contains all of the genetic 
5 information required to direct its own amplification or self-replication in vivo, within a 
target cell. To direct its own replication, the RNA molecule may: 1) encode one or 
more polymerase, replicase, or other proteins which may interact with viral or host cell- 
derived proteins, nucleic acids or ribonucleoproteins to catalyze the RNA amplification 
process; and 2) contain cis RNA sequences required for replication, which may be 

10 bound during the process of replication by its self-encoded proteins, or non-self- 
encoded cell-derived proteins, nucleic acids or ribonucleoproteins, or complexes 
between any of these components. An alphavirus-derived genomic RNA molecule 
should contain the following ordered elements: 5' viral or defective-interfering RNA 
sequence(s) required in cis for replication, sequences which, when expressed, code for 

15 biologically active alphavirus nonstructural proteins (e.g., nsPl, nsP2, nsP3, nsP4), 3' 
viral sequences required in cis for replication, and a polyadenylate tract. The 
alphavirus-derived genomic RNA vector replicon also may contain a viral subgenomic 
"junction region" promoter which may, in certain embodiments, be modified in order to 
prevent, increase, or reduce viral transcription of the subgenomic fragment, and 

20 sequences which, when expressed, code for biologically active alphavirus structural 
proteins (e.g., C, E3, E2, 6K, El). Generally, the term genomic RNA refers to a 
molecule of positive polarity, or "message" sense, and the genomic RNA may be of 
length different from that of any known, naturally-occurring alphavirus. In preferred 
* embodiments, the genomic RNA does not contain the sequences which encode any 

25 alphaviral structural protein(s); rather those sequences are substituted with heterologous 
sequences. In those instances where the genomic RNA is to be packaged into a 
recombinant alphavirus particle, it must contain one or more sequences which serve to 
initiate interactions with alphavirus structural proteins that lead to particle formation, 
and preferably is of a length which is packaged efficiently by the packaging system 

30 being employed. 
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"Subgenomic RNA", or "26S" RNA, refers to a RNA molecule of a 
length or size which is smaller than the genomic RNA from which it was derived. The 
subgenomic RNA should be transcribed from an internal promoter whose sequences 
reside within the genomic RNA or its complement. Transcription of the subgenomic 
5 RNA may be mediated by viral-encoded polymerase^), host cell-encoded 
polymerase(s), transcription factor(s), ribonucleoprotein(s), or a combination thereof. 
In preferred embodiments, the subgenomic RNA is produced from a vector according to 
the invention, and encodes or expresses the gene(s) or sequence(s) of interest. The 
subgenomic RNA need not necessarily have a sedimentation coefficient of 26. 
10 " Alphavirus vector construct " refers to an assembly which is capable of 

directing the expression of a sequence(s) or gene(s) of interest. Such vector constructs 
are comprised of a 5' sequence which is capable of initiating transcription of an 
alphavirus RNA (also referred to as 5* CSE, in background), as well as sequences 
which, when expressed, code for biologically active alphavirus nonstructural proteins 
15 {e.g., nsPl, nsP2, nsP3, nsP4), and an alphavirus RNA polymerase recognition 
sequence (also referred to as 3' CSE, in background). In addition, the vector construct 
should include a viral subgenomic "junction region" promoter which may, in certain 
embodiments, be modified in order to prevent, increase, or reduce viral transcription of 
the subgenomic fragment, and also a polyadenylate tract. The vector also may include 

20 sequences from one or more structural protein genes or portions thereof, extraneous 
nucleic acid molecule(s) which are of a size sufficient to allow production of viable 
virus, a 5* promoter which is capable of initiating the synthesis of viral RNA in vitro 
from cDNA, a heterologous sequence to be expressed, as well as one or more restriction 
sites for insertion of heterologous sequences. 

25 "Alphavirus RNA vector replicon ". " RNA vector replicon " and 

"replicon" refers to a RNA molecule which is capable of directing its own amplification 
or self-replication in vivo, within a target cell. To direct its own amplification, the RNA 
molecule may: 1) encode one or more polymerase, replicase, or other proteins which 
may interact with viral or host cell-derived proteins, nucleic acids or ribonucleoproteins 

30 to catalyze RNA amplification; and 2) contain cis RNA sequences required for 
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replication which may be bound by its self-encoded proteins, or non-self-encoded cell- 
derived proteins, nucleic acids or ribonucleoproteins, or complexes between any of 
these components. In certain embodiments, the amplification also may occur in vitro. 
An alphavirus-derived RNA vector replicon molecule should contain the following 
5 ordered elements: 5* viral sequences required in cis for replication (also referred to as 5* 
CSE, in background), sequences which, when expressed, code for biologically active 
alphavirus nonstructural proteins (e.g., nsPl, nsP2, nsP3, nsP4), 3* viral sequences 
required in cis for replication (also referred Jo as 3* CSE, in background), and a 
polyadenylate tract. The alphavirus-derived RNA vector replicon also may contain a 

10 viral subgenomic "junction region" promoter which may, in certain embodiments, be 
modified in order to prevent, increase, or reduce viral transcription of the subgenomic 
fragment, sequences from one or more structural protein genes or portions thereof, 
extraneous nucleic acid molecule(s) which are of a size sufficient to allow production of 
viable virus, as well as heterologous sequence(s) to be expressed. The source of RNA 

15 vector replicons in a cell may be from infection with a virus or recombinant alphavirus 
particle, or transfection of plasmid DNA or in vitro transcribed RNA. 

" Recombinant Alphavirus Particle " refers to a virion unit containing an 
alphavirus RNA vector replicon. Generally, the recombinant alphavirus particle 
comprises one or more alphavirus structural proteins, a lipid envelope and an RNA 

20 vector replicon. Preferably, the recombinant alphavirus particle contains a nucleocapsid 
structure that is contained within a host cell-derived lipid bilayer, such as a plasma 
membrane, in which alphaviral-encoded envelope glycoproteins are embedded. The 
particle may also contain other components (e.g., targeting elements such as biotin, 
. other viral structural proteins, or other receptor binding ligands) which direct the 

25 tropism of the particle from which the alphavirus was derived, or other RNA molecules. 

" Structural protein expression cassette " refers to a nucleic acid molecule 
which is capable of directing the synthesis of one or more alphavirus structural proteins. 
The expression cassette should include a 5' promoter which is capable of initiating 
in vivo the synthesis of RNA from cDNA, as well as sequences which, when expressed, 

30 code for one or more biologically active alphavirus structural proteins (e.g., C, E3, E2, 
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6K, El), and a 3' sequence which controls transcription termination. The expression 
cassette also may include a 5' sequence which is capable of initiating transcription of an 
alphavirus RNA (also referred to as 5' CSE, in background), a viral subgenomic 
"junction region" promoter, and an alphavirus RNA polymerase recognition sequence 
5 (also referred to as 3' CSE, in background). In certain embodiments, the expression 
cassette also may include splice recognition sequences, a catalytic ribozyme processing 
sequence, a sequence encoding a selectable marker, a nuclear export signal, as well as a 
polyadenylation sequence. In addition, expression of the alphavirus structural protein(s) 
may, in certain embodiments, be regulated by the use of an inducible promoter. 
10 " Stable Transformation " refers to the introduction of a nucleic acid 

molecule into a living cell, and long-term or permanent maintenance of that nucleic acid 
molecule in progeny cells through successive cycles of cell division. The nucleic acid 
molecule may be maintained in any cellular compartment, including, but not limited to, 
the nucleus, mitochondria, or cytoplasm. In preferred embodiments, the nucleic acid 
15 molecule is maintained in the nucleus. Maintenance may be intrachromosomal 
(integrated) or extrachromosomal, as an episomal event. 

" Alphavirus packaging cell line " refers to a cell which contains an 
alphavirus structural protein expression cassette and which produces recombinant 
alphavirus particles after introduction of an alphavirus vector construct, RNA vector 
20 replicon, eukaryotic layered vector initiation system, or recombinant alphavirus particle. 
The parental cell may be of mammalian or non-mammalian origin. Within preferred 
embodiments, the packaging cell line is stably transformed with the structural protein 
expression cassette. 

" Alphavirus producer cell line " refers to a cell line which is capable of 
25 producing recombinant alphavirus particles, comprising an alphavirus packaging cell 
line which also contains an alphavirus vector construct, RNA vector replicon, 
eukaryotic layered vector initiation system, or recombinant alphavirus particle. 
Preferably, the alphavirus vector construct is eukaryotic latered vector initiation system, 
and the producer cell line is stably transfomed with the vector construct. In preferred 
30 embodiments, transcription of the alphavirus vector construct and subsequent 
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production of recombinant alphavinis particles occurs only in response to one or more 
factors, or the differentiation state of the alphavirus producer cell line. 

"Defective helper construct" refers to an assembly which is capable of 
RNA amplification or replication, and expression of one or more alphavirus structural 
5 proteins in response to biologically active alphavirus nonstructural proteins supplied in 
trans. The defective helper construct should contain the following ordered elements: 5* 
viral or defective-interfering RNA sequences required in cis for replication, a viral 
subgenomic junction region promoter, sequences which, when expressed, code for one 
or more biologically active alphavirus structural proteins (e.g., C, E3, E2, 6K, El), 3' 

10 viral sequences required in cis for replication, and a polyadenylate tract. The defective 
helper construct also may contain a 5* promoter which is capable of initiating the 
synthesis of viral RNA from cDNA, a 3' sequence which controls transcription 
termination, splice recognition sequences, a catalytic ribozyme processing sequence, a 
sequence encoding a selectable marker, and a nuclear export signal. 

15 "Eukarvotic Layered V ector Initiation System " refers to an assembly 

which is capable of directing the expression of a sequence(s) or gene(s) of interest. Thfe 
eukaryotic layered vector initiation system should contain a 5' promoter which is 
capable of initiating in vivo (i.e. within a cell) the synthesis of RNA from cDNA, and a 
nucleic acid vector sequence which is capable of directing its own replication in a 

20 eukaryotic cell and also expressing a heterologous sequence. The nucleic acid sequence 
which is capable of directing its own amplification may be of viral or non-viral origin. 
In certain embodiments, the nucleic acid vector sequence is an alphavirus-derived 
sequence and is comprised of a 5' sequence which is capable of initiating transcription 
. of an alphavirus RNA (also referred to as 5' CSE, in background), as well as sequences 

25 which, when expressed, code for biologically active alphavirus nonstructural proteins 
(e.g., nsPl, nsP2, nsP3, nsP4), and an alphavirus RNA polymerase recognition 
sequence (also referred to as 3' CSE, in background). In addition, the vector sequence 
may include a viral subgenomic "junction region" promoter which may, in certain 
embodiments, be modified in order to prevent, increase, or reduce viral transcription of 

30 the subgenomic fragment, sequences from one or more structural protein genes or 
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portions thereof, extraneous nucleic acid molecule(s) which are of a size sufficient to 
allow optimal amplification, a heterologous sequence to be expressed, one or more 
restriction sites for insertion of heterologous sequences, as well as a polyadenylation 
sequence. The eukaryotic layered vector initiation system may also contain splice 
5 recognition sequences, a catalytic ribozyme processing sequence, a nuclear export 
signal, and a transcription termination sequence. In certain embodiments, in vivo 
synthesis of the vector nucleic acid sequence from cDNA may be regulated by the use 
of an inducible promoter. 

" Alphavirus cDNA vector construct " refers to an assembly which is 

10 capable of directing the expression of a sequence(s) or gene(s) of interest. The vector 
construct is comprised of a 5' sequence which is capable of initiating transcription of an 
alphavirus RNA (also referred to as 5' CSE), as well as sequences which, when 
expressed, code for biologically active alphavirus nonstructural proteins (e.g., nsPl, 
nsP2, nsP3, nsP4), and an alphavirus RNA polymerase recognition sequence (also 

1 5 referred to as 3' CSE, in background). In addition, the vector construct should include a 
5' promoter which is capable of initiating in vivo the synthesis of viral RNA from 
cDNA, and a 3' sequence which controls transcription termination. Within certain 
embodiments, the vector construct may further comprise a viral subgenomic "junction 
region" promoter which may, in certain embodiments, be modified in order to prevent, 

20 increase, or reduce viral transcription of the subgenomic fragment The vector also may 
include sequences from one or more structural protein genes or portions thereof, 
extraneous nucleic acid molecule(s) which are of a size sufficient to allow production of 
viable virus, a heterologous sequence to be expressed, one or more restriction sites for 
insertion of heterologous sequences, splice recognition sequences, a catalytic ribozyme 

25 processing sequence, a nuclear export signal, as well as a polyadenylation sequence. In 
certain embodiments, in vivo synthesis of viral RNA from cDNA may be regulated by 
the use of an inducible promoter. 

" Gene delivery vehicle " refers to a construct which can be utilized to 
deliver a gene or sequence of interest. Representative examples include alphavirus 
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RNA vector replicons, alphavirus vector constructs, eukaryotic layered vector initiation 
systems and recombinant alphavirus particles. 

Numerous aspects and advantages of the invention will be apparent to 
those skilled in the art upon consideration of the following detailed description which 
5 provides illumination of the practice of the invention. 

Detailed Description of the Invention 

As noted above, the present invention provides novel gene delivery 
vehicles including for example, RNA vector replicons, alphavirus vector constructs, 

10 eukaryotic layered vector initiation systems and recombinant alphavirus particles. 
Briefly, introduction of plasmid DNA-, in vitro transcribed RNA-, or particle-based 
vectors of the present invention into a cell results in levels of heterologous gene 
expression that are equivalent, or higher, as compared to expression levels of wild-type 
derived alphaviral vectors. Unexpectedly however, the level of vector-specific RNA 

15 synthesized is at least about 5 to 10-fold lower in cultured cells which contain a gene 
delivery vehicle of the present invention, as compared to wild-type derived vectors. 
Furthermore, such gene delivery vehicles exhibit reduced, delayed, or no inhibition of 
host cell-directed macromolecular synthesis following introduction into a host celh as 
compared to wild-type derived vectors. 

20 As discussed in more detail below, the present invention provides: 

(A) sources of wild-type alphaviruses suitable for constructing the gene delivery 
vehicles of the present invention; (B) methods for selecting alphaviruses with a desired 
phenotype; (C) construction of alphavirus vector constructs and alphavirus RNA vector 
- replicons; (D) construction of Eukaryotic Layered Vector Initiation Systems; 

25 (E) construction of recombinant alphavirus particles; (F) heterologous sequences which 
may be expressed by the gene delivery vehicles of the present invention; 
(G) construction of alphavirus packaging or producer cell lines; (H) pharmaceutical 
compositions; and (I) methods for utilizing alphavirus-based vectors. 
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A. Sources of Wild-Type Alphavirus 

As noted above, the present invention provides a wide variety of 
alphavirus-based vectors (e.g., RNA vector replicons, alphavirus vector constructs, 
eukaryotic layered vector initiation systems and recombinant alphavirus particles), as 
5 well as methods for utilizing such vector constructs and particles. Briefly, sequences 
encoding wild-type alphaviruses suitable for use in preparing the above-described 
vectors can be readily obtained given the disclosure provided herein from naturally- 
occurring sources, or from depositories (e.g., the American Type Culture Collection, 
Rockville, Maryland). In addition, wild-type alphaviruses may be utilized for 
10 comparing the level of host-cell directed macromolecular synthesis in cells infected 
with the wild-type alphavirus, with the level of host-cell directed macromolecular 
synthesis in cells containing the gene delivery vehicles of the present invention. 

Representative examples of suitable alphaviruses include Aura virus 
(ATCC VR-368), Bebaru virus (ATCC VR-600, ATCC VR-1240), Cabassou virus 
1 5 (ATCC VR-922), Chikungunya virus (ATCC VR-64, ATCC VR- 1 24 1 ), Eastern equine 
encephalomyelitis virus (ATCC VR-65, ATCC VR-1242), Fort Morgan virus (ATCC 
VR-924), Getah virus (ATCC VR-369, ATCC VR-1243), Kyzylagach virus (ATCC 
VR-927), Mayaro virus (ATCC VR-66, ATCC VR-1277), Middleburg virus (ATCC 
VR-370), Mucambo virus (ATCC VR-580, ATCC VR-1244), Ndumu virus (ATCC 
20 VR-371), Pixuna virus (ATCC VR-372, ATCC VR-1245), Ross River virus (ATCC 
VR-373, ATCC VR-1246), Semliki Forest virus (ATCC VR-67, ATCC VR-1247), 
Sindbis virus (ATCC VR-68, ATCC VR-1248; see also CMCC #4640, described 
below), Tonate virus (ATCC VR-925), Triniti virus (ATCC VR-469), Una virus 
(ATCC VR-374), Venezuelan equine encephalomyelitis virus (ATCC VR-69, ATCC 
25 VR-923, ATCC VR-1250 ATCC VR-1249, ATCC VR-532), Western equine 
encephalomyelitis virus (ATCC VR-70, ATCC VR-1251, ATCC VR-622, ATCC VR- 
1252), Whataroa virus (ATCC VR-926), and Y-62-33 virus (ATCC VR-375). 

For purposes of comparing levels of cellular macromolecular synthesis, 
the following plasmids may also be utilized as a standard source of wild-type alphavirus 
30 stocks. These plasmids include: for Semliki Forest Virus, pSP6-SFV4 (Liljestrom et 
al., J. Virol. 65:4107-4113, 1991); for Venezuelan equine encephalitis virus, pV2000 
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(Davis et al., Vir. 183:20-31, 1991); for Ross River virus, pRR64 (Kuhn et al., Vir. 
/<S2:430-441, 1991). Briefly, for these plasmids, virus can be obtained from BHK cells 
transfected with in vitro transcribed genomic RNA from the plasmids. For Sindbis 
virus, infectious virus may be isolated directly from BHK cells transfected with 
5 pVGELVIS (ATCC No. 75891) plasmid DNA, or alternatively, obtained as a wild-type 
virus stock (see deposit information provided below regarding ATCC No. VR-2526). 

B. Selection of Abhaviruses With a Desired Phenotvpe 

The duration of in vivo heterologous gene expression from alphavirus- 

10 based vectors is affected by several mechanisms, including inhibition of host cell- 
directed macromolecular synthesis. However, prior to the present invention, there had 
been no obvious method to select for or identify coding or non-coding vector viral- 
specific sequence changes that result in a non-cytopathic phenotype. Therefore, within 
one aspect of the present invention methods are provided for isolating and/or 

15 constructing alphavirus-derived gene delivery vehicles with reduced or no inhibition of 
host cell directed macromolecular synthesis. 

1. Biological Selection of Virus Variants 

a. Selection from Virus Stocks Containing PI Particles 
20 One approach for isolating non-cytopathic alphavirus variants exploits 

the presence of defective interfering (Dl) particles in wild-type virus preparations. 
Briefly, although certain RNA viruses, for example rhabdoviruses (e.g., vesicular 
stomatitis virus) and alphaviruses (e.g., Sindbis virus and Semliki Forest virus), are 
. highly cytopathic, they can nevertheless establish long-term persistent infection in 
25 cultured cells in the presence of Dl particles. DI particles, by definition, are derived 
from wild-type virus and contain one or more mutations (e.g., deletions, 
rearrangements, nucleotide substitutions, etc.) from the wild-type genome which 
prevent autonomous replication by the DL In general, the genome of Dl particles is 
smaller and of a lower complexity compared to wild-type virus, and is deleted of 
30 protein-encoding regions while maintaining regions required in cis for replication. Such 
cis sequences often are duplicated and/or rearranged. In the case of certain alphaviruses 
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(e.g., Sindbis vims), the sequence and organization of DI RNA genomes have been 
analyzed and found to contain a minimum of 50 nt from the extreme 3'-end of the wild- 
type virus genome, and at their 5'-ends, either a wild-type sequence or a cellular tRNA 
(e.g., tRNA**) sequence, in addition to the viral sequence. In all cases, the propagation 
5 and maintenance of the mutated DI genomes requires the co-existence of parental helper 
virus in the infected cell. However, as a result of their genetic structure, DI genome 
replication is vastly superior and comparatively abundant to its wild-type counterpart. 
This characteristic results in interference of wild-type genome replication, the absence 
or low level production of infectious virus, and the establishment of long-term 
1 0 persistent infection of cells. 

Therefore, as described below in Examples 1 and 2, the ability to 
establish long-term persistent infection in permissive cells (e.g.. mammalian cells, 
including cells of human origin) by infecting with a mixed alphavirus stock containing a 
population of DI particles provides a mechanism to isolate, over time, fully intact virus 
15 variants that are able to establish persistent infection, even in the absence of DI 
particles. Such infectious virus variants can be isolated from long-term persistently 
infected cultures by multiple rounds of plaque purification and have been found to 
initiate productive, persistent, and non-cytopathic infection in the host cells. 
Furthermore, the level of variant virus produced from such a productive, persistent, and 
20 non-cytopathic infection is indistinguishable from wild-type virus infection. This 
observation is in distinct contrast to the previous requirement for establishment of 
persistent infections with virus stocks containing a mixture of DI particles. 

b. Selection from Virus Stocks Not Containing DI Particles 
25 In addition to selection from virus stocks that contain defective- 

interfering particles, virus variants suitable for use within the present invention may be 
obtained from purified virus stocks (without DI particles) which are either subjected to 
random mutagenesis prior to infection of susceptible cultured cells or allowed to 
generate non-specific mutations during RNA replication with the cultured cells. 
30 Briefly, the initial virus stock may be obtained as a natural isolate or biological variant 
derived therefrom, or may be generated by transfecting cultured cells with an infectious 



WO 97/58087 



PCI7US97/06010 



26 

nucleic acid molecule comprising a genomic cDNA clone or in vitro transcribed RNA. 
If desired, the virus stock may then subjected to physical or chemical mutagenesis 
(although preferred, such mutagenesis is not required). In the case of chemical 
mutagenesis, preferred embodiments utilize a readily available mutagenic agent, for 
5 example nitrous acid, 5-azacytidine, N-methyl-N'-nitro-N-nitrosoguanidine, or 
ethylmethane sulfonate (Sigma, St. Louis, MO), prior to virus infection. Following 
random mutagenesis, specific selection procedures are applied to isolate virus variants 
possessing the desired phenotype, as described in more detail below in Example 2. 

10 2. Genetic Selection of Virus Variants 

In a related approach, mutations may be obtained not using a virus stock, 
but rather, using cloned genomic cDNA of the virus that can be used subsequently to 
transcribe infectious viral RNA in vitro (for example, Sindbis virus (Rice et al, J. Virol 
67:3809-3819, 1987; Dubensky et al., J. Virol 70:508-519, 1996, SFV (Liljestrom et 

15 aL J. Virol 65:4107-4113, 1991, VEE (Davis et al., Virology /<5?J:20-31, 1991), Ross 
River virus (Kuhn et al., Virology 752:430-441, 1991), poliovirus (Van Der Werf et ah, 
Proc. Natl Acad. Sci. USA #5:2330-2334, 1986)) or in vivo (Sindbis virus (Dubensky et 
al., ibid), poliovirus (Racaniello and Baltimore, Science 2/4:916-919, 1981)). Briefly, 
the infectious nucleic acid is introduced into susceptible cultured cells (e.g.. mammalian 

20 cells, including cells of human origin) either directly or following mutagenesis 
performed using one of the above-referenced methods. Alternatively, vector nucleic 
acid may be packaged into particles initially, and the particles used to deliver vector 
into the target cell population for selection. Subsequently, specific selection procedures 
. to isolate vims variants possessing the desired phenotype are applied, and are described 

25 below. 

In certain embodiments, random mutagenesis may be performed initially 
by propagation of the plasmid containing viral cDNA in the XL 1 -Red strain of E. coli 
(Stratagene, San Diego, CA), which is deficient in three of the primary DNA repair 
pathways, resulting from rnutS, mutD, and mutT mutations. However, other 
30 mutagenesis procedures including, but not limited to, linker-scanning mutagenesis 
(Haltiner et al., Nucleic Acids Res. 75:1015, 1985; Barany, Proc Natl Acad. ScL USA 
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52:4202, 1985), random oligonucleotide-directed mutagenesis (Kunkel et al.. Methods 
Enzymol 755:166, 1987; Zoller and Smith, Methods Enzymol 154:329, 1987; Hill 
etal., Methods Enzymol. 75.5:558, 1987; Hermes et al., Gene 84:143, 1989) and PCR 
mutagenesis (Herlitze and Koenen, Gene P7:143, 1990), can be readily substituted 
5 utilizing published protocols. The resulting mixed population of mutated cDNA clones 
is introduced into susceptible cultured cells directly, or after transcription in vitro. 
Enrichment for transfected cells which contain mutated virus of the desired phenotype 
is accomplished based on increased survival time over wild-type virus infected cells, as 
described below. 

10 

3. Genetic Selection of Variants Using Virus-Derived Vectors 

In another approach, mutations may be generated in any region a virus- 
derived expression vector, including the regulatory, untranslated regions, or protein- 
encoding gene regions. For example, within one aspect of the invention methods are 

15 provided for selecting viral variants with reduced or no inhibition of host-cell directed 
macromolecular synthesis, comprising the steps of: (a) introducing into a cell a 
eukaryotic layered vector initiation system, RNA vector replicon, or recombinant 
alphavirus particle which directs the expression of a immunogenic cell surface protein 
(suitable for detection of vector containing cells), or alternatively, a selectable marker 

20 (either a drug or non-drug marker wherein non-vector containing cells are killed upon 
addition of, for example, a drug such as neomycin, hygromycin, phleomycin, gpt, 
puromycin, or histidinol); (b) incubation or culturing the cells under conditions and for 
a time sufficient to select vector containing cells which exhibit the desired phenotype; 
followed by (c) isolating cells which contain the vector of the desired phenotype and (d) 

25 analysis of the vector for the causal mutation. 

As noted above, the viral vectors of the present invention may be derived 
from a wide variety of viruses (e.g., Sindbis virus (Xiong et al., Science 243:\ 188-1 191, 
1989; HahnetaL, Proc Natl. Acad Sci. USA 59:2679-2683, 1992; Schlesinger, Trends 
Biotechnol 77:18-22, 1993; Dubensky et al., ibid), Semliki Forest virus (Liljestrom 

30 and GarofE Bio/Technology 9:1356-1361, 1991), Venezuelan equine encephalitis virus 
(Davis et al., J. Cell. Biochem. Suppl. 79,4:310, 1995), poliovirus (Choi et al., J. Virol 
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65:2875-2883, 1991; Ansardi et al., Cancer Res. 54:6359-6364, 1994; and Andino et 
al., Science 265:1448-1451, 1994). Representative examples of the above-described 
methods are discussed in more detail below within Example 2. 

5 4. Use of Viral Variants 

As discussed in more detail herein, viral variants which have been 
selected or generated utilizing the methods provided herein may be utilized to construct 
a wide variety of recombinant gene delivery vehicles which exhibit the desired 
phenotype. Within certain embodiments, the gene delivery vehicle contains a mutation 

10 within the Leu-Xaa-Pro-Gly-Gly ("LXPGG") motif of the nsP2 gene. Briefly, for 
alphaviruses wherein published sequence of the nsP2 gene is available, a highly 
conserved amino acid motif -Leu-Xaa-Pro-Gly-Gly- ("LXPGG") is observed. As 
predicted by standard protein modelling algorithms (Chou and Fasman, Adv. Ertzym. 
47:45-148, 1978), the residues of this motif possibly comprise a (3 turn in the structure. 

1 5 Proline 726 of nsP2 in Sindbis virus is the central residue of this motif. The 
corresponding motif in other alphaviruses is illustrated in the table below. 

Alphavirus Strain* Pro-Glv-Glv Region nsP2 a.a.'s (P-G-G> 

1 Sindbis virus Leu-Asn-Pro-GIy-Gly-Thr a.a. = 726-728 

2. S.A.AR86 virus Leu-Asn-Pro-GIy-Gly-Thr a.a. = 726-728 

3. Ockelbo virus Leu-Asn-Pro-GIy-Gly-Thr a.a. = 726-728 

4. Aura virus Lcu-Lys-Pro-GIy-Gly-Thr a.a. = 725-727 

5. Semliki Forest virus Leu-Lys-Pro-Gly-Gly-Ile a.a. = 71 8-720 

6. VEE virus Leu-Asn-Pro-GIy-Gly-Thr a.a. = 713-715 

7. Ross River virus Leu-Xaa-Pro-Gly-Gly-Ser a.a. - 7 1 7-7 1 9 

^ * Alphavirus strains with published nsP2 sequences: (1) Strauss et al., Virology 133:92- 
20 1 10, 1984; (2) Simpson et al., Virology 222:464-469 1996; (3) Shirako et al., Virology 
752:753-764, 1991; (4) Rumenapf et al., Virology 205:621-633, 1995; (5) Takkinen, 
Nucleic Acids Res. 74:5667-5682, 1986; (6) Kinney et al., Virology 170:19-30, 1989; 
and (7) Faragher et al, Virology 163:509-526, 1988. 

25 Hence, within various embodiments of the present invention, gene 

delivery vehicles are provided wherein the gene delivery vehicle contains an nsP2 gene 
with a mutation in the LXPGG motiff. Within one embodiment, the Leu codon is 
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mutated to another amino acid selected from the group consisting of Ala, Arg ? Asn, 
Asp, Asx, Cys, Gin, Glu, Glx, Gly, His, He, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, Val, 
or another rare or non-protein amino acid (see, e.g., Lehninger, Biochemistry, Worth 
Publishers, Inc., N.Y. N.Y., 1975). Within another embodiment, the Pro codon is 
5 mutated to another amino acid selected from the group consisting of Ala, Arg, Asn, 
Asp, Asx, Cys, Gin, Glu, Glx, Gly, His, He, Leu, Lys, Met, Phe, Ser, Thr, Trp, Tyr, Val, 
or another rare or non-protein amino acid. Within other embodiments, either or both of 
the Gly codons may be mutated to another amino acid selected from the group 
consisting of Ala, Arg, Asn, Asp, Asx, Cys, Gin, Glu, Glx, His, He, Leu, Lys, Met, Phe, 

1 0 Pro, Ser, Thr, Trp, Tyr, Val, or another rare or non-protein amino acid. Within yet other 
embodiments, the Xaa amino acid, or amino acids between 1 and 3 residues upstream or 
downstream of the LXPGG motiff may be mutated from the wild-type amino acid in 
order to effect the phenotype of the resultant gene delivery vehicle. Within certain 
embodiments of the invention, the LXPGG motiff may be mutated to contain more than 

1 5 one codon alteration, or alternatively, one or more codon insertions or deletions. 

» 

C. Alphavirus Vector Constructs and Alphavirus RNA Vector Replicons 

As noted above, the present invention provides both DNA and RNA 
constructs which are derived from alphaviruses. Briefly, within one aspect of the 

20 present invention alphavirus vector constructs are provided, comprising a 5 f promoter 
which initiates synthesis of viral RNA in vitro from cDNA, a 5' sequence which 
initiates transcription of alphavirus RNA, a nucleic acid molecule which operably 
encodes all four alphaviral nonstructural proteins including an isolated nucleic acid 
molecule as described above, an alphavirus RNA polymerase recognition sequence and 

25 a 3 1 polyadenylate tract. Within other aspects, RNA vector replicons are provided, 
comprising a 5' sequence which initiates transcription of alphavirus RNA, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, including 
the isolated nucleic acid molecules discussed above, an alphavirus RNA polymerase 
recognition sequence and a 3' polyadenylate tract. Within preferred embodiments of the 

30 above, the above constructs further comprise a viral junction region. Each of these 
aspects are discussed in more detail below. 
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1- 5' Promoters which initiate synthesis of viral RNA 

As noted above, within certain embodiments of the invention, alphavirus 
vector constructs are provided which contain 5" promoters which (e.g., DNA dependent 
5 RNA polymerase promoters) initiate synthesis of viral RNA from cDNA by a process 
of in vitro transription. Within preferred embodiments such promoters include, for 
example, the bacteriophage T7, T3, and SP6 RNA polymerase promoters. Similarly, 
eukarytoic layered vector initiation systems are provided (e.g., DNA dependent RNA 
polymerase promoters) which contain 5* promoters which initiate synthesis of viral 

10 RNA from cDNA in vivo (/.<?., within a cell). Within certain embodiments, such RNA 
polymerase promoters (for either alphavirus vector constructs or eukaryotic layered 
vector initiation systems) may be derived from both prokaryotic and eukaryotic 
organisms, and include, for example, the bacterial p-galactosidase and trpE promoters, 
and the eukaryotic viral simian virus 40 (SV40) (e.g., early or late), cytomegalovirus 

15 (CMV) (e.g., immediate early), Moloney murine leukemia virus (MoMLV) or Rous 
sarcoma virus (RSV) LTR, and herpes simplex virus (HSV) (thymidine kinasfe) 
promoters. 

2. Sequences Which Initiate Transcription 

20 As noted above, within preferred embodiments the alphavirus vector 

constructs and RNA vector replicons of the present invention contain a 5' sequence 
which is capable of initiating transcription of an alphavirus RNA (also referred to as 5- 
end CSE, or 5 r cis replication sequence). Representative examples of such sequences 
. include nucleotides 1-60, and to a lesser extent nucleotides through bases 150-210, of 

25 the wild-type Sindbis virus, nucleotides 10-75 for tRNA Asp (aspartic acid, Schlesinger 
etaL, U.S. Patent No. 5,091,309), and 5* sequences from other alphaviruses which 
initiate transcription. It is the complement of these sequences, which corresponds to the 
3' end of the of the minus-strand genomic copy, which are bound by the nsP replicase 
complex, and possibly additional host cell factors, from which transcription of the 

30 positive-strand genomic RNA is initiated. 
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3. Alphavirus Nonstructural Proteins 

The alphavirus vector constructs and RNA vector replicons provided 
herein also require sequences encoding all four alphaviral nonstructural proteins, 
5 including a sequence which provides the desired phenotype discussed above. Briefly, a 
wide variety of sequences which encode alphavirus nonstructural proteins, in addition to 
those explicitly provided herein, may be utilized in the present invention, and are 
therefore deemed to fall within the scope of the phrase "alphavirus nonstructural 
proteins." For example, due to the degeneracy of the genetic code, more than one codon 
10 may code for a given amino acid. Therefore, a wide variety of nucleic acid sequences 
which encode alphavirus nonstructural proteins may be generated. Furthermore, amino 
acid substitutions, additions, or deletions at any of numerous positions may still provide 
functional or biologically active nonstructural proteins. Within the context of the 
present invention, alphavirus nonstructural proteins are deemed to be biologically active 
15 if they promote self-replication of the vector construct, i.e., replication of viral nucleic 
acids and not necessarily the production of infectious virus, and may be readily 
determined by metabolic labeling or RNase protection assays performed over a time 
course. Methods for making such derivatives are readily accomplished by one of 
ordinary skill in the art given the disclosure provided herein. 
20 Alphaviruses express four nonstructural proteins, designated nspl, nsp2, 

nsp3, and nsp4. Vectors of the present invention derived from alphaviruses should 
contain sequences encoding the four nonstructural proteins. In wild-type Sindbis virus, 
nonstructural proteins 1-3 are encoded by nucleotides 60 to 5747, while nsP4 is encoded 
_ by nucleotides 5769 to 7598 (see Figure 1). The nonstructural proteins are translated 
25 from the genomic positive strand RNA as one of two large polyproteins, known as PI 23 
or PI 234, respectively, depending upon (i) whether there is an opal termination codon 
between the coding regions of nsP3 and nsP4 and (ii) if there is such an opal codon 
present, whether there is translation termination of the nascent polypeptide at that point 
or readthrough and hence production of P1234. The opal termination codon is present 
30 at the nsP3/nsP4 junction of the alphaviruses SIN (strain AR339 and the SIN-1 strain 
described herein), AURA, WEE, EEE, VEE, and RR, and thus the PI 23 and PI 234 
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species are expressed in cells infected with these viruses. In contrast, no termination 
codon is present at the nsP3/nsP4 junction of the alphaviruses SIN (strain AR86, SF, 
and ONN), and thus only the PI 234 species is expressed in cells infected with these 
viruses. Both the polyprotein and processed monomeric forms of the nonstructural 
5 proteins function in the replication of the alphavirus RNA genome. Experiments 
examining growth characteristics of alphavirus nonstructural protein cleavage mutants 
have indicated that the polyproteins arc involved in the synthesis of the genomic 
negative stranded RNA, while the individual monomeric proteins catalyze the synthesis 
of the genomic and subgenomic positive stranded RNA species (Shirako and Strauss* 1 

10 Virol 65:1874-1885, 1994). Translational readthrough generally occurs about 10%- 
20% of the time in cells infected with wild type Sindbis virus containing the opal 
termination codon at the nsP3/nsP4 junction. Processing of PI 23 and PI 234 is by a 
proteinase activity encoded by the one of the nonstructural proteins, and is discussed 
further below. The order of processing, whether in cis or in trans, depends on various 

15 factors, including the stage of infection. For example, Sindbis virus and SFV produce 
P123 and nsp4 early in infection, and PI 2 and P34 later in infection. Further processing 
then releases the individual nonstructural proteins. Each nonstructural protein has 
several functions, some of which are described below. 

20 a. nsPl 

Nonstructural protein 1 is required for the initiation of (or continuation 
of) minus-strand RNA synthesis. It also plays a role in capping the 5* terminus of 
genomic and subgenomic alphavirus RNAs during transcription, as nsPl possesses both 
methyltransferase (Mi and Stollar, Vir. 754:423-427, 1991) and guanyltransferase 

25 activity (Strauss and Strauss, Microbiol Rev. 55(3):49 1-562, 1994). NsPl also 
modulates the proteinase activity of nsP2, as polyproteins containing nsPl inefficiently 
cleave between nsP2 and nsP3 (de Groot et al., EMBOl 9:2631-2638, 1990). 



30 



b. nsP2 

Nonstructural protein 2 is a multifunctional protein, involved in the 
replication of the viral RNA and processing of the nonstructural polyprotein. The N- 
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terminal domain of the protein (spanning about the first 460 amino acids) is believed to 
be a helicase which is active in duplex unwinding during RNA replication and 
transcription. Synthesis of 26S subgenomic mRNA, which, in vectors according to the 
present invention, encodes the gene(s) of interest, requires functional nsP2. The C- 
5 terminal domain of nsP2, between amino acid residues 460-807 of Sindbis virus, 
proteolytic^ ly cleaves in trans and in cis the nonstructural polyprotein between the 
nsPl/nsP2, nsP2/nsP3, and nsP3/nsP4 junctions. Alignment of the primary sequences 
of the alphavirus nsP2 C-terminal domains suggests that nsP2 is a papain-like 
proteinase (Hardy and Strauss, J. Virol 65:4653-4664, 1988). 

J 0 Other observed characteristics of nsP2 have not, as yet, been assigned a 

function directly related to the propagation of alphaviruses. For example, it has been 
shown that nsP2 is closely associated with ribosomes in SFV-infected cells, and can be 
cross-linked to rRNA by UV irradiation (Ranki et al., FEBS Lett, 705:299-302, 1979), 
Further, 50% of nsP2 is localized in the nuclear matrix, particularly in the area of the 

15 nucleoli of SFV-infected BHK cells (Peranen et al., J. Virol 64:1888-1896, 1990). 
Localization of nsP2 to the nuclei presumably proceeds by active transport, as it 
exceeds the size of small proteins and metabolites (about 20-60 kD), which can enter 
the nucleus by diffusion through nuclear core complexes (Paine et al.. Nature 254:109- 
1 14, 1975). Putative NLS sequences have been identified in the alphaviruses SFV, SIN, 

20 RR, ONN, OCK, and VEE (Rikkonen et al., Vir. 759:462-473, 1 992). 

c. nsP3 

Nonstructural protein nsP3 contains two distinct domains, although their 
precise roles in viral replication are not well understood. The N-terminal domain ranges 

25 in length from 322 to 329 residues in different alphaviruses and exhibits a minimum of 
51% amino acid sequence identity among any two alphaviruses. The C-terminal 
domain, however, is not conserved among known alphaviruses in length or in sequence, 
and multiple changes are tolerated (Li et al., Virology, 779:416-427). The protein is 
found associated with replication complexes in a heavily phoshorylated state. In 

30 alphaviruses whose genomes contain an opal termination codon between the nsP3/nsP4 
junction, two different proteins are produced depending upon whether or not there is 
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readthrough of the opal termination signal. Readthrough results in an nsP3 protein 
which contains 7 additional carboxy terminal amino acids after cleavage of the 
polyprotein. It is clear that nsP3 is required in some capacity for viral RNA synthesis, 
as particular mutants of this protein are RNA negative, and the PI 23 polyprotein is 
5 required for minus-strand RNA synthesis. 

d. nsP4 

NsP4 is the virus-encoded RNA polymerase and contains the GDD motif 
characteristic of such enzymes (Kamer and Argos, Nucleic Acids Res. 72:7269-7282, 

10 1984). Thus, nsP4 is indispensable for alphavirus RNA replication. The concentration 
of nsP4 is tightly regulated in infected cells. In most alphaviruses, translation of nsP4 
requires readthrough of an opal codon between the nsP3 and nsP4 coding regions, 
resulting in lower intracellular levels as compared to other nonstructural proteins. 
Additionally, the bulk of nsP4 is metabolically unstable, through degradation by the N- 

15 end rule pathway (Gonda et al., J. Biol. Chem. 204:16700- 1 67 12, 1989). However, 
some nsP4 is stable, due to its association with replication complexes which conceal 
degradation signals. Thus, stabilization of the enzyme by altering the amino terminal 
residue may prove useful in promoting more long term expression of proteins encoded 
by the vectors described herein. Stabilizing amino terminal residues include 

20 methionine, alanine, and tyrosine. 

4. Viral Junction Regions 

The alphavirus viral junction region normally controls transcription 
. initiation of the subgenomic mRNA; thus, this element is also referred to as the 
25 subgenomic mRNA promoter. In the case of Sindbis virus, the normal viral junction 
region typically begins at approximately nucleotide number 7579 and continues through 
at least nucleotide number 7612 (and possibly beyond). At a minimum, nucleotides 
7579 to 7602 (5'- ATC TCT ACG GTG GTC CTA AAT AGT - SEQ. ID NO. 1) are 
believed necessary for transcription of the subgenomic fragment. This region 
30 (nucleotides 7579 to 7602) is hereinafter referred to as the "minimal junction region 
core." 
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Within certain aspects of the invention, the viral junction region is 
inactivated in order to prevent synthesis of the subgenomic fragment. As utilized 
within the context of the present invention, "inactivated" means that the species 
corresponding to subgenomic mRNA is not observed in autoradiograms from 
5 denaturing gels of electrophoresed RNA purified from cells containing these vectors 
and treated with 1 ng/ml dactinomycin and labeled with [ 3 H]-uridine, as described 
(Frolov and Schlesinger, 7. Virol 65:1721-1727, 1994). 

Within one embodiment of the invention, gene delivery vehicles may be 
constructed by the placement of signals promoting either ribosome readthrough or 
10 internal ribosome entry immediately downstream of the disabled junction region 
promoter. In this vector configuration, synthesis of subgenomic message cannot occur; 
however, the heterologous proteins are expressed from genomic length mRNA by either 
ribosomal readthrough (scanning) or internal ribosome entry. 

In certain applications of the gene delivery vehicles described herein, the 
15 expression of more than one heterologous gene is desired. For example, in order to 
treat metabolic disorders such as Gaucher's syndrome, multiple administrations of gene 
delivery vehicles or particles may be required, since duration of the therapeutic 
palliative may be limited. Therefore, within certain embodiments of the invention it 
may be desirable to co-express in a target cell the Adenovirus 2 E3 gene, along with a 
20 therapeutic palliative, such as the glucocerebrosidase gene. In wild-type virus, the 
structural protein (sP) polycistronic message is translated into a single polyprotein 
which is processed subsequently into individual proteins in part by the sP capsid 
proteinase. Thus, expression of multiple heterologous genes from a polycistronic 
message requires a mechanism different from the wild-type virus, since the protease 
25 activity of the capsid sP, or the peptides recognized for cleavage, are not present in the 
replacement region of the alphavirus vectors. Therefore, within further embodiments of 
the invention, functional elements which permit translation of multiple independent 
heterologous sequences, including ribosomal readthrough, cap-independent translation, 
internal ribosome entry, or minimal junction region core sequences, can be utilized. 

30 



WO 97/38087 



36 



PCT/US97/06010 



5- Alphavirus RNA polymerase recogni tion sequence, and poly(A) trart 

As noted above, alphavirus vector constructs or RNA vector replicons of 
the present invention also should include an alphavirus RNA polymerase recognition 
sequence (also termed "alphavirus replicase recognition sequence", "3' terminal CSE". 
5 or H 3' cis replication sequence"). Briefly, the alphavirus RNA polymerase recognition 
sequence, which is located at the 3' end region of positive stranded genomic RNA, 
provides a recognition site at which the virus begins replication by synthesis of the 
negative strand. A wide variety of sequences, may be utilized as an alphavirus RNA 
polymerase recognition sequence. For example, within one embodiment, Sindbis virus 

1 0 vector constructs in which the polymerase recognition is truncated to the smallest region 
that can still function as a recognition sequence (e.g., nucleotides 1 1,684 to 1 1,703) can 
be utilized. Within another embodiment of the invention, Sindbis virus vector 
constructs in which the entire nontranslated region downstream from the El sP gene to 
the 3' end of the viral genome including the polymerase recognition site (e.g., 

1 5 nucleotides 1 1 ,382 to 1 1 ,703), can be utilized. 

Within preferred embodiments of the invention, the alphavirus vect6r 
construct or RNA vector replicon may additionally contain a poly(A) tract, which 
increases dramatically the observed level of heterologous gene expression in cells 
transfected with alphavirus-derived vectors (see e.g., Dubensky et al, supra). Briefly, 

20 the poIy(A) tract may be of any size which is sufficient to promote stability in the 
cytoplasm, thereby increasing the efficiency of initiating the viral life cycle. Within 
various embodiments of the invention, the poly(A) sequence comprises at least 10 
adenosine nucleotides, and most preferably, at least 25 adenosine nucleotides. Within 
- one embodiment, the poly(A) sequence is attached directly to Sindbis virus nucleotide 

25 11,703. 

D. Eukarvotic Layered Vector Initiation Systems 

Due to the size of a full-length genomic alphavirus cDNA clone, in vitro 
transcription of full-length, capped RNA molecules is rather inefficient. This results in 
30 a lowered transfection efficiency, in terms of infectious centers of virus (as measured by 
plaque formation), relative to the amount of in vitro transcribed RNA transfected. Such 
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inefficiency is also relevant to the in vitro transcription of alphavirus expression 
vectors. Testing of candidate cDNA clones and other alphavirus cDNA expression 
vectors for their ability to initiate an infectious cycle or to direct the expression of a 
heterologous sequence can thus be greatly facilitated if a cDNA clone is transfected into 
5 susceptible cells as a DNA molecule, which then directs the synthesis of viral RNA 
in vivo. 

Therefore, within one aspect of the present invention DNA-based vectors 
(referred to as "Eukaryotic Layered Vector Initiation Systems") are provided that are 
capable of directing the synthesis of viral RNA (genomic or vector) in vivo. Generally, 
10 eukaryotic layered vector initiation systems comprise a 5' promoter that is capable of 
initiating in vivo (i.e., within a cell) the 5' synthesis of RNA from cDNA, a construct 
that is capable of directing its own replication in a cell, the construct also being capable 
of expressing a heterologous nucleic acid sequence, and a 3' sequence that controls 
transcription termination (e.g., a polyadenylate tract). Such eukaryotic layered vector 

15 initiation systems provide a two-stage or "layered" mechanism that controls expression 
of heterologous nucleotide sequences. Briefly, the first layer initiates transcription of 
the second layer and comprises a promoter that is capable of initiating in vivo the 5* to 3' 
synthesis of RNA from cDNA (e.g., a 5' eukaryotic promoter), and may farther 
comprise other elements, including a 3* transcription termination/polyadenylation site, 

20 one or more splice sites, as well as other RNA nuclear export elements, including, for 
example, the hepatitis B virus posttranscriptional regulatory element (PRE) (Huang et 
al., Mol Cell Biol. 73:7476, 1993; Huang et al., J. Virol 65:3193, 1994; Huang et al. f 
MoL Cell. Biol, 75:3864-3869, 1995), the Mason-Pfizer monkey virus constitutive 
transport element (CTE) (Bray et al., Proc. Natl Acad Sci. USA 97:1256-1260, 1994), 

25 the HIV Rev responsive element (Malim et al., Nature 355:254-257, 1 989; Cullen et al.. 
Trends Biochem. ScL 75:346, 1991), and other similar elements, if desired. 
Representative promoters suitable for use within the present invention include both 
eukaryotic (e.g., pol I, II, or III) and prokaryotic promoters, and inducible or non- 
inducible (i.e., constitutive) promoters, such as, for example, Moloney murine leukemia 

30 virus promoters, metallothionein promoters, the glucocorticoid promoter, Drosophila 
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actin 5C distal promoter, SV40 promoter, heat shock protein 65 promoter, heat shock 
protein 70 promoter, immunoglobulin promoters, mouse polyoma virus promoter (Py), 
Rous sarcoma Virus (RSV), herpes simplex vims (HSV) promoter, BK virus and JC 
virus promoters, mouse mammary tumor virus (MMTV) promoter, alphavirus junction 
5 region, CMV promoter, Adenovirus El or VA1RNA promoters, rRNA promoters, 
tRNA methionine promoter, CaMV 35S promoter, nopaline synthetase promoter, 
tetracycline responsive promoter, and the lac promoter. 

Within yet other embodiments of the invention, inducible promoters may 
be utilized. For example, within one embodiment inducible promoters are provided 

10 which initiate the synthesis of RNA from DNA, comprising a core RNA polymerase 
promoter sequence, and an operably linked nucleic acid sequence that directs the DNA 
binding of a transcriptional activator protein, and an operably linked nucleic acid 
sequence that directs the DNA binding of a transcriptional repressor protein. Within a 
further embodiment, the nucleic acid sequence that directs the DNA binding of a 

15 transcriptional activator protein is a sequence that binds a tetracycline repressor/VP16 
transactivator fusion protein. Within yet another embodiment, the nucleic acid 
sequence that directs the DNA binding of a transcription repressor protein is a sequence 
that binds a lactose repressor / Kruppel domain fusion protein. 

The second layer comprises an autocatalytic vector construct which is 

20 capable of expressing one or more heterologous nucleotide sequences and of directing 
its own replication in a cell, either autonomously or in response to one or more factors 
(e.g: is inducible). The second layer may be of viral or non-viral origin. Within one 
embodiment of the invention, the second layer construct may be an alphavirus vector 
. construct as described above. 

25 A wide variety of vector systems may be utilized as the first layer of the 

eukaryotic layered vector initiation system, including for example, viral vector 
constructs developed from DNA viruses such as those classified in the Poxviridae, 
including for example canary pox virus or vaccinia virus (e.g., Fisher-Hoch et al., PNAS 
56:317-321, 1989; Flexner et al., Ann. N.Y. Acad Sci. 569:86-103, 1989; Flexner et al., 

30 Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330 and 5,017,487; WO 



WO 97/38087 



39 



PCT/US97/06010 



89/01973); Papoviridae such as BKV, JCV or SV40 (e.g.. Mulligan etal.. Nature 
277:108-114, 1979); Adenoviridae, such as adenovirus (e.g., Berkner, Biotechniques 
6:616-627, 1988; Rosenfeld eta!., Science 252:431-434, 1991); Parvoviridae, such as 
adeno-associated virus (e.g., Samulski etal., J. Vir. 65:3822-3828, 1989; Mendelson 
5 et al., Virol 766:154-165, 1988; PA 7/222,684); Herpesviridae, such as Herpes Simplex 
Virus (e.g.. Kit, Adv. Exp. Med Biol. 2/5:219-236, 1989); and Hepadnaviridae (e.g., 
HBV), as well as certain RNA viruses which replicate through a DNA intermediate, 
such as the Retroviridae (see, e.g., U.S. Patent No. 4,777,127, GB 2,200,651, EP 
0,345,242 and W09 1/02805; Retroviridae include leukemia in viruses such as MoMLV 
10 and immunodeficiency viruses such as HIV, e.g., Poznansky, J. Virol. 65:532-536, 
1991). 

Similarly, a wide variety of vector systems may be utilized as second 
layer of the eukaryotic layered vector initiation system, including for example, vector 
systems derived from viruses of the following families: Picornaviridae (e.g., poliovirus, 
15 rhinovirus, coxsackieviruses), Caliciviridae, Togaviridae (e.g., alphavirus, rubella), 
Flaviviridae (e.g., yellow fever, HCV), Coronaviridae (e.g., HCV, TGEV, IBV, MHV, 
BCV), Bunyaviridae, Arenaviridae, Retroviridae (e.g., RSV, MoMLV, HIV, HTLV), 
hepatitis delta virus and Astrovims. In addition, non-mammalian RNA viruses (as well 
as components derived therefrom) may also be utilized, including for example, bacterial 

20 and bacteriophage replicases, as well as components derived from plant viruses, such as 
potexviruses (e.g., PVX), carlaviruses (e.g., PVM), tobravimses (e.g., TRV, PEBV, 
PRV), Tobamoviruses (e.g., TMV, ToMV, PPMV), luteoviruses (e.g., PLRV), 
poty viruses (e.g., TEV, PPV, PVY), tombusviruses (e.g., CyRSV), nepovimses (e.g., 
GFLV), bromoviruses (e.g., BMV), and topamoviruses. 

? 25 The replication competency of the autocatalytic vector construct, 

contained within the second layer of the eukaryotic vector initiation system, may be 
measured by a variety of assays known to one of skill in the art including, for example, 
ribonuclease protection assays which measure increases of both positive-sense and 
negative-sense RNA in transfected cells over time, in the presence of an inhibitor of 

30 cellular RNA synthesis, such as dactinomycin, and also assays which measure the 
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synthesis of a subgenomic RNA or expression of a heterologous reporter gene in 
transfected cells. 

Within particularly preferred embodiments of the invention, eukaryotic 
layered vector initiation systems are provided that comprise a 5' promoter which is 
5 capable of initiating in vivo the synthesis of alphavirus RNA from cDNA (i.e., a DNA 
promoter of RNA synthesis), followed by a 5' sequence which is capable of initiating 
transcription of an alphavirus RNA, a nucleic acid sequence which operably encodes all 
four alphaviral nonstructural proteins (including a nucleic acid molecule as described 
above which, when operably incorporated into a recombinant alphavirus particle, results 

10 in the desired phenotype), an alphavirus RNA polymerase recognition sequence, and a 
3' sequence which controls transcription termination/polyadenylation. In addition, a 
viral junction region which is operably linked to a heterologous sequence to be 
expressed may be included. Within various embodiments, the viral junction region may 
be modified, such that viral transcription of the subgenomic fragment is increased or 

15 reduced, rather than inactivated. Within other embodiments, a second viral junction 
region may be inserted following the first inactivated viral junction region, the second 
viral junction region being either active or modified such that viral transcription of the 
subgenomic fragment is increased or reduced. 

Following transcription of the eukaryotic layered vector initiation 

20 system, the resulting alphavirus RNA vector replicon molecule is comprised of a 5* 
sequence which is capable of initiating transcription of an alphavirus RNA, a nucleotide 
sequence encoding biologically active alphavirus nonstructural proteins, a viral junction 
region, a heterologous nucleotide sequence, an alphavirus RNA polymerase recognition 
. sequence, and a poly adenylate sequence. 

25 Various aspects of the alphavirus cDNA vector constructs have been 

discussed above, including the 5' sequence which is capable of initiating transcription of 
an alphavirus, the nucleotide sequence encoding alphavirus nonstructural proteins, the 
viral junction region, including junction regions which have been inactivated such that 
viral transcription of the subgenomic fragment is prevented, and the alphavirus RNA 



WO 97/38087 



41 



PCT/US97/06010 



polymerase recognition sequence. In addition, modified junction regions and tandem 
junction regions have also been discussed above. 

In another embodiment of the invention, the eukaryotic layered vector 
initiation system is derived from an alphavirus vector, such as a Sindbis vector 
5 construct, which has been adapted to replicate in one or more cell lines from a particular 
eukaryotic species, especially a mammalian species, such as humans. For instance, if 
the gene encoding the recombinant protein to be expressed is of human origin and the 
protein is intended for human therapeutic use, production in a suitable human cell line 
may be preferred in order that the protein be post-translationally modified as would be 
10 expected to occur in humans. This approach may be useful in further enhancing 
recombinant protein production (as discussed in more detail below). Given the overall 
plasticity of an alphaviral genome due to the infidelity of the viral replicase, variant 
strains with an enhanced ability to establish high titer productive infection in selected 
eukaryotic cells {e.g., human, murine, canine, feline, etc.) can be isolated. Additionally, 
15 variant alphaviral strains having an enhanced ability to establish high titer persistent 
infection in eukaryotic cells may also be isolated using this approach. Alphavirus 
expression vectors can then be constructed from cDNA clones of these variant strains 
according to procedures provided herein. 

Within another embodiment of the invention, the eukaryotic layered 
20 vector initiation system comprises a promoter for initial alphaviral vector transcription 
that is transcriptionally active only in a differentiated cell type. Briefly, it is well 
established that alphaviral infection of mammalian cells in culture, such as those 
derived from hamster (e.g., baby hamster kidney cells) or chicken (e.g., chicken embryo 
fibroblasts), typically results in cytoxicity. Thus, to produce a stably transformed or 
25 transfected host cell line, the eukaryotic layered vector initiation system may be 
introduced into a host cell wherein the promoter which enables the initial vector 
amplification is a transcriptionally inactive, but inducible, promoter. In a particularly 
preferred embodiment, such a promoter is differentiation state dependent. In this 
configuration, activation of the promoter and subsequent activation of the alphavirus 
30 DNA vector coincides with induction of cell differentiation. Upon growth to a certain 
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cell number of such a stably transformed or transfected host cell line, the appropriate 
differentiation stimulus is provided, thereby initiating transcription of the vector 
construct and amplified expression of the desired gene and encoded polypeptide^). 
Many such differentiation state-dependent promoters are known to those in the art, as 
5 are cell lines which can be induced to differentiate by application of a specific stimulus. 
Representative examples include cell lines F9 and PI 9, HL60, and Freund 
erythroleukemic cell lines and HEL, which are activated by retinoic acid, horse serum, 
and DMSO, respectively. 

In a preferred embodiment, such promoters can be regulated by two 

10 separate components. For example, as described in Example 7, binding sites for both a 
transcriptional activator and a transcriptional repressor are positioned adjacent to a 
"core" promoter, in an operably-dependent manner. In this configuration, the 
uninduced state is maintained by blocking the ability of the transcriptional activator to 
find its recognition site, while allowing the transcriptional repressor to be constitutively 

15 expressed and bound to its recognition site. Induction is permitted by blocking the 
transcriptional repressor and removing the transactivator block. For example, * a 
tetracycline-responsive promoter system (Gossen and Bujard, Proc. Nati Acad. ScL 
59:5547-5551, 1992) may be utilized for inducible transcription of an alphavirus vector 
RNA. In this system, the expression of a tetracycline repressor and HSV-VP16 

20 transactivator domain, as a "fusion" protein (rTA), stimulates in vivo transcription of the 
alphavirus vector RNA by binding specifically to a tetracycline operator sequence 
(tetO) located immediately adjacent to a minimal "core" promoter (for example, CMV). 
The binding and transactivation event is reversibly blocked by the presence of 
. tetracycline, and may be "turned on" by removing tetracycline from the culture media. 

25 As uninduced basal levels of transcription will vary among different cell types, other 
different minimal core promoters (for example HSV-tk) may be linked to the 
tetracycline operator sequences, provided the transcription start site is known, to allow 
juxtaposition at or in the immediate proximity of alphavirus vector nucleotide 1 . 

The rTA transactivator can be provided by an additional expression 

30 cassette also stably transformed into the same cell line; and in certain embodiments, the 
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rTA expression cassette may itself be autoregulatory. The use of an autoregulatory rTA 
expression cassette circumvents potential toxicity problems associated with constitutive 
high level expression of rTA by linking expression to transcriptional control by the 
same tetO-Iinked promoter to which rTA itself binds. This type of system creates a 
5 negative feedback cycle that ensures very little rTA is produced in the presence of 
tetracycline, but becomes highly active when the tetracycline is removed. Such an 
autoregulatory rTA expression cassette is provided in plasmid pTet-tTAk (Shockett 
et al M Proc. Natl Acad ScL USA 92:6522-6526, 1995). 

For transcriptional repression, the KRAB repression domain of a certain 
10 zinc finger proteins can also be utilized. Briefly, KRAB (Kriippel-associated box) 
domains are highly conserved sequences present in the amino-terminal regions of more 
than one-third of all Kriippel-class Cys 2 His 2 zinc finger proteins. The domains contain 
two predicted amphipathic a-helicies and have been shown to function as DNA 
binding-dependent RNA polymerase II transcriptional repressors (for example, Licht 
15 et al., Nature 346: 76-79, 1990). Like other transcription factors, the active repression 
domain and the DNA-binding domain are distinct and separable. Therefore, *the 
repression domain can be linked as a fusion protein to any sequence specific DNA 
binding protein for targeting. Thus, the DNA binding protein component can be 
reversibly prevented from binding in a regulatable fashion, thereby turning "ofF the 
20 transcriptional silencing. For example, the KRAB domain from human Koxl (Thiesen, 
New Biol 2:363-374, 1990) can be fused to the DNA-binding lactose (lac) repressor 
protein, forming a hybrid transcriptional silencer with reversible, sequence-specific 
binding to a lac operator sequence engineered immediately adjacent to the tet- 
responsive promoter. In this configuration, constitutive expression of the lac 
25 repressor/KRAB domain fusion (rKR) will result in binding to the lac operator sequence 
and the elimination of any "leaky" basal transcription from the uninduced tet-responsive 
promoter. When vector expression is desired and tetracycline is removed from the 
system, IPTG is added to prevent rKR-mediated transcriptional silencing. 

In addition, KRAB domains from other zinc finger proteins, for example, 
30 ZNF133 (Tommerup et al., Hum. Mol Genet 2:1571-1575, 1993), ZNF91 (Bellefroid 
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ttal.,EMBOJ. 72:1363-1374, 1993), ZNF2 (Rosati etaL, Nucleic Acids Res. 79:5661- 
5667, 1991), as well as other transferable repressor domains, for example, Drosophila 
en or eve genes (Jaynes and OTarrell, EMBOJ. 70:1427-1433, 1991; Han and Manley, 
Genes Dev. 7:491-503, 1993), human zinc finger protein YY1 (Shi et al. f Cell 67:377- 
5 388, 1991), Wilms' tumor suppressor protein WT1 (Madden et al., Science 253: 1550- 
1553, 1991), thyroid hormone receptor (Baniahmad et al., EMBO J. 77:1015-1023, 
1992), retinoic acid receptor (Baniahmad et al., ibid), Kid-1 (Witzgall et al., Proc. Natl. 
Acad ScL USA 97:4514-4518, 1994), can likewise be readily used in the gene delivery 
vehicles provided herein. Furthermore, the lac repressor/lac operator component of this 
10 system may be substituted by any number of other regulatable systems derived from 
other sources, for example, the tryptophan and maltose operons, or GAL4. 

E. Recombinant Alphavirus Particles, and Generation and Use of 'Empty' 
Toeaviru s Particles or Togaviruses Particles containing non-homologous viral 
15 RNA 

Within another aspect of the present invention, the generation of 
recombinant alphavirus particles containing RNA alphavirus vectors, which are capable 
of infection of eukaryotic target cells, are described. Briefly, such recombinant 
alphavirus particles generally comprise one or more alphavirus structural proteins, a 

20 lipid envelope, and an RNA vector replicon as described herein. 

Methods for generating recombinant alphavirus vector particles may be 
readily accomplished by, for example, co-transfection of complementing vector and 
defective helper (DH) molecules derived from in vitro transcribed RNA, or, 
alternatively, plasmid DNA, or by coinfection with virus (see Xiong et al., Science 

25 243: 1 1 88- 1 1 9 1 , 1 989, Bredenbeek et al., J. Virol 67:6439-6446, 1 993, Dubensky et al., 
J. Virol 70:508-519, 1996 and Dubensky et al., W/O 95/07994). 

Within other aspects, methods for generating recombinant alphavirus 
vector particles from alphavirus-derived packaging or producer cell lines are provided. 
Briefly, such PCL and their stably transformed structural protein expression cassettes 

30 can be derived using methods described within W/O 95/07994, or using novel methods 
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described within this invention. For example, the production of recombinant alphavirus 
vector particles by PCL can be accomplished following introduction of alphavirus- 
based vector molecules with desirable properties into the PCL (see Example 6), the 
vectors being derived from in vitro transcribed RNA, plasmid DNA, or previously 
5 obtained recombinant alphavirus particles. In yet a Anther example, production of 
recombinant particles from alphavirus vector producer cell lines is described (see 
Example 7). 

Within other embodiments, methods are provided for producing high- 
titer stable togavirus capsid particles that do not contain any genomic RNA (/>., contain 
10 substantially no viral RNA) or RNA Vector Replicons. As utilized within the present 
invention, it should be understood that "substantially no" genomic or RNA Vector 
Replicon nucleic acids refers to ratios of greater than 10:1, and preferably greater than 
15:1 of 35 S methionine versus 3 H uridine incorporation into virus particles (as compared 
to wild-type) (see. e.g., Example 8 and Figure 38). For example, within one 

15 embodiment empty capsid particles (preferably with the lipid bilayter and lycoprotein 
complement) are constructed from a selected pathogenic virus from the togavirus family 
(such as an Alphavirus or Rubivirus), and used as immunogens to establish protective 
immunity against infection with the wild-type togavirus. The empty viral particles are a 
desirable immunogenic alternative, as they are unable to replicate and produce virus, yet 

20 are able to generate both cellular and humoral immune responses. Thus, utilizing the 
methods which are described in more detail in Example 8, empty capsid particles 
derived from togaviruses (with or without a lipid bilayer and glycoprotein complement) 
can be generated from a wide variety of togaviruses, including, but not limited to, 
. alphaviruses (such as Sindbis Virus (e.g., SIN-1 or wild-type Sindbis virus), 

25 Venezuelan Equine Encephalitis virus, Ross River virus, Eastern Equine Encephalitis 
virus, Western Equine Encephalitis vims, and rubiviruses (e.g., rubella), 

In a second embodiment, sequences from heterologous viruses which 
encode peptides that bind to genomic viral RNA can be inserted into a defective helper 
(DH) expression cassette in the amino terminal region of the alphavirus capsid gene, 

30 which has been deleted of the sequences which encode the region of the protein that 
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binds to the homologous alphavirus genomic RNA. For example, BHK cells can be 
electrbporated with an alphavims replicon RNA, a DH RNA containing a sequence that 
encodes a heterologous virus genomic RNA binding peptide, and a replicon derived 
from the same heterologous virus. Thus, the alphavirus particles produced contain a 
5 genomic RNA from a heterologous virus, and possess the host-range tropism of the 
alphavirus. As one possible example, gag sequences encoding proteins required for 
retrovirus RNA binding are included in the DH expression cassette construct. In this 
configuration, the resulting alphavirus particles would contain retrovirus vector RNA. 

10 F. Heterologous Sequences 

As noted above, a wide variety of nucleotide sequences may be carried 
and expressed by the gene delivery vehicles of the present invention. Preferably, the 
nucleotide sequences should be of a size sufficient to allow production of viable virus. 
Within the context of the present invention, the production of any measurable titer by 

15 recombinant alphavirus particles, for example, by plaque assay, luciferase assay, or 
p-galactosidase assay of infectious vims on appropriate susceptible monolayers, or thfc 
expression of detectable levels of the heterologous gene product by RNA or DNA 
vectors, is considered to be "production of viable vims." This may be, at a minimum, 
an alphavims vector construct which does not contain any additional heterologous 

20 sequence. However, within other embodiments, the vector construct may contain 
additional heterologous or foreign sequences. Within preferred embodiments, the 
heterologous sequence can comprise a heterologous sequence of at least about 100 
bases, 2 kb, 3.5 kb, 5 kb, 7 kb, or even a heterologous sequence of at least about 8 kb. 

As will be evident to one of ordinary skill in the art given the disclosure 

25 provided herein, the efficiency of recombinant alphavirus particle packaging and hence, 
viral titer, is to some degree dependent upon the size of the sequence to be packaged. 
Thus, in order to increase the efficiency of packaging and the production of viable vims, 
additional non-coding sequences may be added to the vector construct. Moreover, 
within certain embodiments of the invention it may be desired to increase or decrease 

30 viral titer. This increase or decrease may be accomplished by increasing or decreasing 
the size of the heterologous sequence, and hence the efficiency of packaging. 
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As briefly noted above, a wide variety of heterologous sequences may be 
included within the gene delivery vehicles described herein including, for example, 
sequences which encode palliatives such as lymphokines or cytokines, toxins, prodrug 
converting enzyme, antigens which stimulate an immune response, ribozymes, proteins 
5 for therapeutic application such as growth or regulatory factors, and proteins which 
assist or inhibit an immune response, as well as antisense sequences (or sense sequences 
for "antisense applications"). In addition, as discussed above, the gene delivery vehicles 
provided herein may contain (and express, within certain embodiments) two or more 
heterologous sequences. 

10 

1. Lymphokines 

Within one embodiment of the invention, the heterologous sequence 
encodes a lymphokine. Briefly, lymphokines act to proliferate, activate, or differentiate 
immune effectors cells. Representative examples of lymphokines include gamma 

15 interferon, tumor necrosis factor, IL-L IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, 
IL-10, IL-il, IL-12, IL-13, IL-14, IL-15, GM-CSF, CSF-1 andG-CSF. 

Within related embodiments of the invention, the heterologous sequence 
encodes an immunomodulatory cofactor. Briefly, as utilized within the context of the 
present invention, "immunomodulatory cofactor" refers to factors which, when 

20 manufactured by one or more of the cells involved in an immune response, or when 
added exogenously to the cells, causes the immune response to be different in quality or 
potency from that which would have occurred in the absence of the cofactor. The 
quality or potency of a response may be measured by a variety of assays known to one 
of skill in the art including, for example, in vitro assays which measure cellular 

25 proliferation (e.g., 3 H thymidine uptake), and in vitro cytotoxic assays (e.g., which 
measure 5, Cr release) (see Warner et al., AIDS Res. and Human Retroviruses 7:645-655, 
1991). 

Representative examples of immunomodulatory co-factors include alpha 
interferon (Finter etal., Drugs 42(5):749-765, 1991; U.S. Patent No. 4,892,743; U.S. 
30 Patent No. 4,966,843; WO 85/02862; Nagata etal.. Nature 25*316-320, 1980; 
Familletti etal., Methods in Em. 75:387-394, 1981; Twu etal., Proc. Natl. Acad Sci. 
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USA 56:2046-2050, 1989; Faktor etal., Oncogene 3:867-872, 1990), beta interferon 
(Seif etal., J. Virol. 65:664-671, 1991), gamma interferons (Radford etal., American 
Society of Hepatology:200&-2015, 1991; Watanabe etal., PNAS 56:9456-9460, 1989; 
Gansbacher etal., Cancer Research 50:7820-7825, 1990; Maio etal., Can. Immunol. 
5 Immunother. 50:34-42, 1989; U.S. Patent Nos. 4,762,791 and 4,727,138), G-CSF (U.S. 
Patent Nos. 4,999,291 and 4,810,643), GM-CSF (WO 85/04188), TNFs (Jayaraman 
etal., J. Immunology 74*942-951, 1990), Interleukin- 2 (IL-2) (Karupiah etal., J. 
Immunology 144:290-298, 1990; Weber etal,. J. Exp. Med. 166: 1716-1733, 1987; 
Gansbacher etal., J Exp. Med. 772:1217-1224, 1990; U.S. Patent No. 4,738,927), IL-4 

10 (Tepper etal., Cell 57:503-512. 1989; Golumbek etal., Science 254:713-716, 1991; 
U.S. Patent No. 5,017,691), IL-6 (Brakenhof et al.. J. Immunol. 759:4116-4121, 1987; 
WO 90/06370), IL-12, IL-15 (Grabstein etal., Science 264:965-968, 1994; Genbank- 
EMBL Accession No. V03099), ICAM-1 (Altman etal., Nature 555:512-514, 1989), 
ICAM-2, LFA-1, LFA-3, MHC class I molecules, MHC class II molecules, 

15 2 -microglobulin, chaperones, CD3, B7/BB1, MHC linked transporter proteins or 
analogues thereof. 

The choice of which immunomodulatory cofactor to include within a 
alphavirus vector construct may be based upon known therapeutic effects of the 
cofactor, or experimentally determined. For example, in chronic hepatitis B infections 

20 alpha interferon has been found to be efficacious in compensating a patient's 
immunological deficit and thereby assisting recovery from the disease. Alternatively, a 
suitable immunomodulatory cofactor may be experimentally determined. Briefly, blood 
samples are first taken from patients with a hepatic disease. Peripheral blood 
- lymphocytes (PBLs) are restimulated in vitro with autologous or HLA-matched cells 

25 (e.g., EBV transformed cells), and transduced with an alphavirus vector construct which 
directs the expression of an immunogenic portion of a hepatitis antigen and the 
immunomodulatory cofactor. Stimulated PBLs are used as effectors in a CTL assay 
with the HLA-matched transduced cells as targets. An increase in CTL response over 
that seen in the same assay performed using HLA-matched stimulator and target cells 

30 transduced with a vector encoding the antigen alone, indicates a useful 
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immunomodulatory cofactor. Within one embodiment of the invention, the 
immunomodulatory cofactor gamma interferon is particularly preferred. 

Another example of an immunomodulatory cofactor is the B7/BB1 
costimulatory factor. Briefly, activation of the full functional activity of T cells requires 
5 two signals. One signal is provided by interaction of the antigen-specific T cell receptor 
with peptides which are bound to major histocompatibility complex (MHC) molecules, 
and the second signal, referred to as costimulation, is delivered to the T cell by antigen- 
presenting cells. The second signal is required for interleukin-2 (IL-2) production by 
T cells and appears to involve interaction of the B7/BB1 molecule on antigen- 

10 presenting cells with CD28 and CTLA-4 receptors on T lymphocytes (Linsley et al., J. 
Exp Med 773:721-730, 1991a, and J. Exp. Med 774:561-570, 1991). Within one 
embodiment of the invention, B7/BBI may be introduced into tumor cells in order to 
cause costimulation of CD8 + T cells, such that the CD8 + 1 cells produce enough IL-2 to 
expand and become fully activated. These CD8 + T cells can kill tumor cells that are not 

15 expressing B7 because costimulation is no longer required for further CTL function. 
Vectors that express both the costimulatory B7/BB1 factor and, for example,* an 
immunogenic HBV core protein, may be made utilizing methods which are described 
herein. Cells transduced with these vectors will become more effective antigen- 
presenting cells. The HBV core-specific CTL response will be augmented from the 

20 fully activated CD8* T ceil via the costimulatory Iigand B7/BB 1 . 

2. Toxins 

Within another embodiment of the invention, the heterologous sequence 
encodes a toxin. Briefly, toxins act to directly inhibit the growth of a cell. 

25 Representative examples of toxins include ricin (Lamb et al., Eur. J. Biochem. 148:265- 
270, 1985), abrin (Wood etal., Eur. J. Biochem. 795:723-732, 1991; Evensen etal., 
J. of Biol. Chem. 266:6848-6852, 1991; Collins et al., J. of Biol. Chem. 265:8665-8669, 
1990; Chen etal., Fed. of Eur. Biochem Soc. 509:115-118, 1992), diphtheria toxin 
(Tweten et al., J. BioL Chem. 260:10392-10394, 1985), cholera toxin (Mekalanos et al., 

30 Nature J06:S51-557, 1983; Sanchez and Holmgren, PNAS 56:481-485, 1989), gelonin 
(Stirpe etal., J. Biol. Chem. 255:6947-6953, 1980), pokeweed (Irvin, Pharmac Ther. 
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27:371-387, 1983), antiviral protein (Barbieri etal., Biochem. J. 203:55-59, 1982; Irvin 
et ah; Arch. Biochem. & Biophys. 200:418-425, 1980; Irvin, Arch. Biochem. & Biophys. 
769:522-528, 1975), tritin, Shigella toxin (Calderwood etal., PNAS £4:4364-4368, 
1987; Jackson et al., Microb. Path. 2:147-153, 1987), Pseudomonas exotoxin A (Carroll 
5 and Collier, J. Biol Chem. 262:8707-8711, 1987), herpes simplex vims thymidine 
kinase (HSVTK) (Field etal., 1 Gen. Virol. 49:115-124, 1980), and £ coli. guanine 
phosphoribosyl transferase. 

3. Prodrug converting enzymes 

10 Within other embodiments of the invention, the heterologous sequence 

encodes a prodrug converting enzyme. Briefly, as utilized within the context of the 
present invention, a prodrug converting enzyme refers to a gene product that activates a 
compound with little or no cytotoxicity into a toxic product (the prodrug). 
Representative examples of such gene products include HSVTK and VZVTK (as well 

15 as analogues and derivatives thereof), which selectively monophosphorylate certain 
purine arabinosides and substituted pyrimidine compounds, converting them to 
cytotoxic or cytostatic metabolites. More specifically, exposure of the drugs 
ganciclovir, acyclovir, or any of their analogues (e.g., FIAU, FIAC DHPG) to HSVTK 
phosphorylates the drug into its corresponding active nucleotide triphosphate form. 

20 Representative examples of other prodrug converting enzymes which can 

also be utilized within the context of the present invention include: E. coli guanine 
phosphoribosyl transferase which converts thioxanthine into toxic thioxanthine 
monophosphate (Besnard et al., Mol Cell Biol 7:4139-4141, 1987); alkaline 
^ phosphatase, which converts inactive phosphorylated compounds such as mitomycin 

25 phosphate and doxorubicin-phosphate into toxic dephosphorylated compounds; fungal 
(e.g., Fusarium oxysporum) or bacterial cytosine deaminase, which converts 5- 
fluorocytosine to the toxic compound 5-fluorouracil (Mullen, PNAS 59:33, 1992); 
carboxypeptidase G2, which cleaves the glutamic acid from para-N-bis (2-chloroethyI) 
aminobenzoyl glutamic acid, thereby creating a toxic benzoic acid mustard; and 

30 Penicillin-V amidase, which converts phenoxyacetabide derivatives of doxorubicin and 
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melphalan to toxic compounds (see generally, Vrudhula et ah, J. of Med Chem. 
36(7):9 19-923, 1993; Kern et al., Carte. Immun, Immunother. J/(4):202-206, 1990). 

4. Antisense Sequences 
5 Within another embodiment of the invention, the heterologous sequence 

is an antisense sequence. Briefly, antisense sequences are designed to bind to RNA 
transcripts, and thereby prevent cellular synthesis of a particular protein or prevent use 
of that RNA sequence by the cell. Representative examples of such sequences include 
antisense thymidine kinase, antisense dihydrofolate reductase (Maher and Dolnick, 
10 Arch Biochem. & Biophys. 255:214-220, 1987; Bzik et al., / > M4S<W:8360-8364, 1987), 
antisense HER2 (Coussens etal., Science 250:1 132-1139, 1985), antisense ABL 
(Fainstein etal., Oncogene 4:1477-1481, 1989), antisense Myc (Stanton etal., Nature 
J/0:423-425, 1984) and antisense ras, as well as antisense sequences which block any 
of the cell cycle signaling components (e.g., cyclins, cyclin-dependent kinases, cyclin- 
1 5 dependent kinase inhibitors) or enzymes in the nucleotide biosynthetic pathway. In 
addition, within other embodiments of the invention antisense sequences to interferon 
and 2 microglobulin may be utilized in order to decrease immune response. 

In addition, within a further embodiment of the invention, antisense 
RNA may be utilized as an anti-tumor agent in order to induce a potent Class I 
20 restricted response. Briefly, in addition to binding RNA and thereby preventing 
translation of a specific mRNA, high levels of specific antisense sequences are believed 
to induce the increased expression of interferons (including gamma-interferon) due to 
the formation of large quantities of double-stranded RNA. The increased expression of 
gamma interferon, in turn, boosts the expression of MHC Class I antigens. Preferred 
25 antisense sequences for use in this regard include actin RNA, myosin RNA, and histone 
RNA. Antisense RNA which forms a mismatch with actin RNA is particularly 
preferred. 



30 



5. Ribozvmes 

Within other aspects of the present invention, gene delivery vehicles are 
provided which produce ribozymes upon infection of a host cell. Briefly, ribozymes are 
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used to cleave specific RNAs and are designed such that it can only affect one specific 
RNA sequence. Generally, the substrate binding sequence of a ribozyme is between 10 
and 20 nucleotides long. The length of this sequence is sufficient to allow a 
hybridization with target RNA and disassociation of the ribozyme from the cleaved 
5 RNA. 

A wide variety of ribozymes may be utilized within the context of the 
present invention, including for example, Group 1 intron ribozymes (Cech et al., U.S. 
Patent No. 4,987,071); hairpin ribozymes (Hampel et aL Nucl Acids Res 75:299-304, 
1990, U.S. Patent No. 5,254,678 and European Patent Publication No. 0 360 257), 

10 hammerhead ribozymes (Rossi, J.J. etal., Pharmac. Ther. 50:245-254, 1991; Forster 
and Symons, Cell 48:21 1-220, 1987; Haseloff and Gerlach, Nature J25:596-600, 1988; 
Walbot and Bruening, Nature 334:196, 1988; Haseloff and Gerlach, Nature 334:5%5 y 
1988), hepatitis delta virus ribozymes (Perrotta and Been, Biochem. 31: 16, 1992); 
RNase P ribozymes (Takada et al., Cell 35:849, 1983); as well as other types of 

15 ribozymes (see e.g., WO 95/29241, and WO 95/31551). Further examples of ribozymes 
include those described in U.S. Patent Nos. 5,1 16,742, 5,225,337 and 5,246,921. 

6. Proteins and other cellular constituents 

Within other aspects of the present invention, a wide variety of proteins 
20 or other cellular constituents may be carried and/or expressed by the gene delivery 
vehicles provided herein. Representative examples of such proteins include native or 
altered cellular components, as well as foreign proteins or cellular constituents, found in 
for example, viruses, bacteria, parasites or fungus. 

25 a. Altered Cellular Components 

Within one embodiment, gene delivery vehicles are provided which 
direct the expression of an immunogenic, non-tumorigenic, altered cellular component 
{see, e.g., WO 93/10814). As utilized herein, the term "immunogenic" refers to altered 
cellular components which are capable, under the appropriate conditions, of causing an 

30 immune response. This response must be cell-mediated, and may also include a 
humoral response. The term ,, non-tumorigenic ,, refers to altered cellular components 
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which will not cause cellular transformation or induce tumor formation in nude mice. 
The phrase "altered cellular component" refers to proteins and other cellular 
constituents which are either associated with rendering a cell tumorigenic, or are 
associated with tumorigenic cells in general, but are not required or essential for 
5 rendering the cell tumorigenic. 

Briefly, before alteration, the cellular components may be essential to 
normal cell growth and regulation and include, for example, proteins which regulate 
intracellular protein degradation, transcriptional regulation, cell-cycle control, and cell- 
cell interaction. After alteration, the cellular components no longer perform their 
10 regulatory functions and, hence, the cell may experience uncontrolled growth. 
Representative examples of altered cellular components include ras\ p53\ Rb\ altered 
protein encoded by the Wilms' tumor gene, ubiquitin*, mucin", protein encoded by the 
DCC, APC, and MCC genes, the breast cancer gene BRCAf, as well as receptors or 
receptor-like structures such as neu, thyroid hormone receptor, platelet derived growth 
1 5 factor (PDGF) receptor, insulin receptor, epidermal growth factor (EGF) receptor, and 
the colony stimulating factor (CSF) receptor. 

Once a sequence encoding the altered cellular component has been 
obtained, it is necessary to ensure that the sequence encodes a non-tumorigenic protein. 
Various assays which assess the tumorigenicity of a particular cellular component are 
20 known and may easily be accomplished. Representative assays include a rat fibroblast 
assay, tumor formation in nude mice or rats, colony formation in soft agar, and 
preparation of transgenic animals, such as transgenic mice. 

Tumor formation in nude mice or rats is a particularly important and 
fc sensitive method for determining the tumorigenicity of a particular cellular component. 
25 Nude mice lack a functional cellular immune system (i.e., do not possess CTLs), and 
therefore provide a useful in vivo model in which to test the tumorigenic potential of 
cells. Normal non-tumorigenic cells do not display uncontrolled growth properties if 
infected into nude mice. However, transformed cells will rapidly proliferate and 
generate tumors in nude mice. Briefly, in one embodiment an alphavirus vector 
30 construct is administered to syngeneic murine cells, followed by injection into nude 
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mice. The mice are visually examined for a period of 2 to 8 weeks after injection in 
order to determine tumor growth. The mice may aJso be sacrificed and autopsied in 
order to determine whether tumors are present. (Giovanella et al., J. Natl. Cancer Inst. 
45:1531-1533, 1972; Furesz et al., Abnormal Cells, New Products and Risk, Hopps and 
5 Petricciani (eds.), Tissue Culture Association, 1985; and Levenbook et al., J. Biol Std 
7J:135-141, 1985.) 

Tumorigenicity may also be assessed by visualizing colony formation in 
soft agar (Macpherson and Montagnier, Virol 23:291-294, 1964). Briefly, one property 
of normal non-tumorigenic cells is "contact inhibition" (i.e., cells will stop proliferating 

10 when they touch neighboring cells). If cells are plated in a semi-solid agar support 
medium, norma! cells rapidly become contact inhibited and stop proliferating, whereas 
tumorigenic cells will continue to proliferate and form colonies in soft agar. 

If the altered cellular component is associated with making the cell 
tumorigenic, then it is necessary to make the altered cellular component non- 

1 5 tumorigenic. For example, within one embodiment the sequence or gene of interest 
which encodes the altered cellular component is truncated in order to render the gene 
product non-tumorigenic. The gene encoding the altered cellular component may be 
truncated to a variety of sizes, although it is preferable to retain as much as possible of 
the altered cellular component. In addition, it is necessary that any truncation leave 

20 intact at least some of the immunogenic sequences of the altered cellular component. 
Alternatively, multiple translational termination codons may be introduced downstream 
of the immunogenic region. Insertion of termination codons will prematurely terminate 
protein expression, thus preventing expression of the transforming portion of the 
' protein. 

25 As noted above, in order to generate an appropriate immune response, 

the altered cellular component must also be immunogenic. Immunogenicity of a 
particular sequence is often difficult to predict, although T cell epitopes often possess an 
immunogenic amphipathic alpha-helix component. In general, however, it is preferable 
to determine immunogenicity in an assay. Representative assays include an ELISA, 

30 which detects the presence of antibodies against the newly introduced vector, as well as 
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assays which test for T helper cells such as gamma-interferon assays, IL-2 production 
assays, and proliferation assays. 

As noted above, within another aspect of the present invention, several 
different altered cellular components may be co-expressed in order to form a general 
5 anti-cancer therapeutic. Generally, it will be evident to one of ordinary skill in the art 
that a variety of combinations can be made. Within preferred embodiments, this 
therapeutic may be targeted to a particular type of cancer. For example, nearly all colon 
cancers possess mutations in ras, p53, DCC APC or MCC genes. An alphavirus vector 
construct which co-expresses a number of these altered cellular components may be 
1 0 administered to a patient with colon cancer in order to treat all possible mutations. This 
methodology may also be utilized to treat other cancers. Thus, an alphavirus vector 
construct which co-expresses mucin*, ras\ neu, BRCAf and p53' may be utilized to 
treat breast cancer. 

15 b. Antigens from foreign organisms or other pathogens 

Within other aspects of the present invention, vectors are provided whfch 
direct the expression of immunogenic portions of antigens from foreign organisms or 
other pathogens. Representative examples of such antigens include bacterial antigens 
(e.g.. E. coli, streptococcal, staphylococcal, mycobacterial, etc.), fungal antigens, 

20 parasitic antigens, and viral antigens (e.g., influenza virus, Feline Leukemia Virus 
("FeLV"), immunodeficiency viruses such as Feline Immunodeficiency Virus ("FIV") 
or Human Immmunodeficiency Virus ("HIV"), Hepatitis A, B and C Virus ("HAV", 
"HBV M and "HCV", respectively), Human Papiloma Virus ("HPV"), Epstein-Barr Virus 
* ("EBV"), Herpes Simplex Virus ("HSV"), Hantavirus, TTLV I, HTLV II and 

25 Cytomegalovirus ("CMV"). As utilized within the context of the present invention, 
"immunogenic portion" refers to a portion of the respective antigen which is capable, 
under the appropriate conditions, of causing an immune response (i.e., cell-mediated or 
humoral). "Portions" may be of variable size, but are preferably at least 9 amino acids 
long, and may include the entire antigen. Cell-mediated immune responses may be 

30 mediated through Major Histocompatability Complex ("MHC") class I presentation, 
MHC Class II presentation, or both. 



WO 97/38087 



56 



PCT/US97/06010 



Within one aspect of the invention, alphavirus vector constructs are 
provided which direct the expression of immunogenic portions of Hepatitis B antigens 
(see, e.g., WO 93/15207). Briefly, the Hepatitis B genome is comprised of circular 
DNA of about 3.2 kilobases in length and has been well characterized (Tiollais et al., 
5 Science 21 3:406-4 1 1 , 1 98 1 ; Tiollais et al., Nature 31 7:489-495, 1 985; and Ganem and 
Varm\is,Ann. Rev. Biochem. 56:65 1-693. 1987; see also EP 0 278,940, EP 0 241.021, 
WO 88/10301, and U.S. Patent Nos. 4,696,898 and 5,024,938, which are hereby 
incorporated by reference). The Hepatitis B virus presents several different antigens, 
including among others, three HB "S" antigens (HBsAgs), an HBc antigen (HBcAg), an 
10 HBe antigen (HBeAg), and an HBx antigen (HBxAg) [see Blum et al., TIG 5(5): 154- 
158, 1989). Briefly, the HBeAg results from proteolytic cleavage of a P22 pre-core 
intermediate and is secreted from the cell. HBeAg is found in serum as a 1 7 kD protein. 
The HBcAg is a protein of 183 amino acids, and the HBxAg is a protein of 145 to 154 
amino acids, depending on subtype. 

15 The HBsAgs (designated "large," "middle" and "small") are encoded by 

three regions of the Hepatitis B genome: S, pre-S2 and pre-Sl. The large protein, 
which has a length varying from 389 to 400 amino acids, is encoded by pre-Sl, pre-S2, 
and S regions, and is found in glycosylated and non-glycosylated forms. The middle 
protein is 281 amino acids long and is encoded by the pre-S2 and S regions. The small 

20 protein is 226 amino acids long and is encoded by the S region. It exists in two forms, 
glycosylated (GP 27 s ) and non-glycosylated (P24 1 ). If each of these regions are 
expressed separately, the pre-Sl region will code for a protein of approximately 119 
amino acids, the pre-S2 region will code for a protein of approximately 55 amino acids, 
* and the S region will code for a protein of approximately 226 amino acids. 

25 As wil1 be evident to one of ordinary skill in the art, various 

immunogenic portions of the above-described S antigens may be combined in order to 
induce an immune response when administered by one of the alphavirus vector 
constructs described herein. In addition, due to the large immunological variability that 
is found in different geographic regions for the S open reading frame of HBV, particular 
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combinations of antigens may be preferred for administration in particular geographic 
regions. 

Also presented by HBV are pol ("HBV poI"\ ORF 5, and ORF 6 
antigens. Briefly, the polymerase open reading frame of HBV encodes reverse 
5 transcriptase activity found in virions and core-like particles in infected livers. The 
polymerase protein consists of at least two domains: the amino terminal domain which 
encodes the protein that primes reverse transcription, and the carboxyl terminal domain 
which encodes reverse transcriptase and RNase H activity. Immunogenic portions of 
HBV pol may be determined utilizing methods described herein, utilizing alphavirus 

1 0 vector constructs described below, and administered in order to generate an immune 
response within a warm-blooded animal. Similarly, other HBV antigens, such as ORF 
5 and ORF 6 (Miller et al., Hepatology 9:322-327, 1989) may be expressed utilizing 
alphavirus vector constructs as described herein. 

As noted above, at least one immunogenic portion of a hepatitis B 

15 antigen is incorporated into a gene delivery vehicle. The immunogenic portion(s) 
which are incorporated into the gene delivery vehicles may be of varying length, 
although it is generally preferred that the portions be at least 9 amino acids long and 
may include the entire antigen. Immunogenicity of a particular sequence is often 
difficult to predict, although T cell epitopes may be predicted utilizing computer 

20 algorithms such as TS1TES (Medlmmune, Maryland), in order to scan coding regions 
for potential T-helper sites and CTL sites. From this analysis, peptides are synthesized 
and used as targets in an in vitro cytotoxic assay. Other assays, however, may also be 
utilized, including, for example, ELISA, which detects the presence of antibodies 
- against the newly introduced vector, as well as assays which test for T helper cells, such 

25 as gamma-interferon assays, IL-2 production assays and proliferation assays. 

Immunogenic portions may also be selected by other methods. For 
example, the HLA A2.1 transgenic mouse has been shown to be useful as a model for 
human T-cell recognition of viral antigens. Briefly, in the influenza and hepatitis B 
viral systems, the murine T cell receptor repertoire recognizes the same antigenic 

30 determinants recognized by human T cells. In both systems, the CTL response 
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generated in the HLA A2.1 transgenic, mouse is directed toward virtually the same 
epitope as those recognized by human CTLs of the HLA A2.1 haplotype (Vitiello et aL 
J. Exp. Med. 775:1007-1015, 1991; Vitiello et al., Abstract of Molecular Biology of 
Hepatitis B Virus Symposia, 1992). Particularly preferred immunogenic portions for 
5 incorporation into alphavirus vector constructs include HBeAg, HBcAg and HBsAgs. 
Additional immunogenic portions of the hepatitis B virus may be obtained by truncating 
the coding sequence at various locations including, for example, the following sites: 
Bst UI, Ssp I. Ppu Ml, and Msp I (Valenzuela etal., Nature 250:815-19, 1979; 
Valenzuela etal., Animal Virus Genetics: 1CN/UCLA Symp. Mol. Cell Biol., 1980, B. 
1 0 N. Fields and R. Jaenisch (eds.), pp. 57-70, New York: Academic). 

As noted above, more than one immunogenic portion may be 
incorporated into the gene delivery vehicles. For example, a gene delivery vehicle may 
express (either separately or as one construct) all or immunogenic portions of HBcAg, 
HBeAg, HBsAgs, HBxAg, as well as immunogenic portions of HCV antigens. 

15 

1- Sources for Heterologous Sequences 

Sequences which encode the above-described proteins may be readily 
obtained from a variety of sources, including for example, depositories such as the 
American Type Culture Collection (ATCC, Rockville. MD), or from commercial 

20 sources such as British Bio-Technology Limited (Cowley, Oxford, England). 
Representative examples include BBG 12 (containing the GM-CSF gene coding for the 
mature protein of 127 amino acids); BBG 6 (which contains sequences encoding 
gamma interferon), ATCC No. 39656 (which contains sequences encoding TNF), 
- ATCC No. 20663 (which contain sequences encoding alpha interferon), ATCC Nos. 

25 31902, 31902 and 39517 (which contains sequences encoding beta interferon), ATCC 
No 67024 (which contain a sequence which encodes Interleukin-lb); ATCC Nos. 
39405, 39452, 39516, 39626 and 39673 (which contains sequences encoding 
Interleukin-2); ATCC Nos. 59399, 59398, and 67326 (which contain sequences 
encoding Interleukin-3); ATCC No. 57592 (which contains sequences encoding 

30 Interleukin-4), ATCC Nos. 59394 and 59395 (which contain sequences encoding 
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Interleukin-5), and ATCC No. 67153 (which contains sequences encoding Interleukin- 
6). 

Sequences which encode altered cellular components as described above 
may be readily obtained from a variety of sources. For example, plasmids which 
5 contain sequences that encode altered cellular products may be obtained from a 
depository such as the American Type Culture Collection (ATCC, Rockville, MD), or 
from commercial sources such as Advanced Biotechnologies (Columbia, Maryland). 
Representati ve examples of plasmids containing some of the above-described sequences 
include ATCC No. 41000 (containing a G to T mutation in the 12th codon of ras), and 

1 0 ATCC No. 4 1 049 (containing a G to A mutation in the 1 2th codon). 

Alternatively, plasmids which encode normal cellular components may 
also be obtained from depositories such as the ATCC (see, for example, ATCC No. 
41001, which contains a sequence which encodes the normal ras protein; ATCC No. 
57103, which encodes abl; and ATCC Nos. 59120 or 59121, which encode the bcr 

15 locus) and mutated to form the altered cellular component. Methods for mutagenizing 
particular sites may readily be accomplished using methods known in the art (see 
Sambrook et al., supra., 15.3 et seq.). In particular, point mutations of normal cellular 
components such as ras may readily be accomplished by site-directed mutagenesis of 
the particular codon, for example, codons 1 2, 1 3 or 61 . 

20 Sequences which encode the above-described viral antigens may 

likewise be obtained from a variety of sources. For example, molecularly cloned 
genomes which encode the hepatitis B virus may be obtained from sources such as the 
American Type Culture Collection (ATCC, Rockville, MD). For example, ATCC No. 
- 45020 contains the total genomic DNA of hepatitis B (extracted from purified Dane 

25 particles) (see Figure 3 of Blum etal., TIG 5(5): 154- 158, 1989) in the Bam HI site of 
pBR322 (Moriarty et al., Proa Natl Acad. Sci USA 75:2606-2610, 1981). 

Alternatively, cDNA sequences which encode the above-described 
heterologous sequences may be obtained from cells which express or contain the 
sequences. Briefly, within one embodiment, mRNA from a cell which expresses the 

30 gene of interest is reverse transcribed with reverse transcriptase using oligonucleotide 
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dT or random primers. The single stranded cDNA may then be amplified by PCR (see 
U.S. Patent Nos. 4,683,202; 4,683,195 and 4,800,159. See also PCR Technology. 
Principles and Applications for DNA Amplification, Erlich (ed.), Stockton Press, 1989) 
utilizing oligonucleotide primers complementary to sequences on either side of desired 
5 sequences. In particular, a double-stranded DNA is denatured by heating in the 
presence of heat stable Taq polymerase, sequence-specific DNA primers, dATP, dCTP, 
dGTP and dTTP. Double-stranded DNA is produced when synthesis is complete. This 
cycle may be repeated many times, resulting in a factorial amplification of the desired 
DNA. 

0 Sequences which encode the above-described proteins may also be 

synthesized, for example, on an Applied Biosystems Inc. DNA synthesizer (e.g., APB 
DNA synthesizer model 392 (Foster City, CA)). 

G- Alphavirus Packaging / Producer Cell Lines 

5 Within further aspects of the invention, alphavirus packaging and 

producer cell lines are provided. In particular, within one aspect of the present 
invention, alphavirus packaging cell lines are provided wherein the viral structural 
proteins are supplied in trans from one or more stably transformed expression vectors, 
and are able to encapsidate transfected, transduced, or intracellularly produced vector 

) RNA transcripts in the cytoplasm and release infectious packaged vector particles 
through the cell membrane. In preferred embodiments, the structural proteins necessary 
for packaging are synthesized at high levels only after induction by the RNA vector 
replicon itself or some other provided stimulus, and the transcripts encoding these 
structural proteins are capable of cytoplasmic amplification in a manner that will allow 

> expression levels sufficient to mimic that of a natural viral infection. Furthermore, in 
other embodiments, expression of a selectable marker is operably linked to the 
structural protein expression cassette. Such a linked selectable marker allows efficient 
generation of functional, stably transformed PCL. 

For example, alphavirus RNA vector replicon molecules of the desired 

) phenotype to be packaged, which are themselves capable of autocatalytic replication in 
the cell cytoplasm, can be introduced into the packaging cells as in vitro transcribed 
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RNA, recombinant alphavims particles, or as alphavirus cDNA vector constructs. The 
RNA vector transcripts then replicate to high levels, stimulate amplification of the 
structural protein gene transcripts) and subsequent protein expression, and are 
subsequently packaged by the viral structural proteins, yielding infectious vector 
5 particles. The intracellular expression of alphavirus proteins and/or vector RNA above 
certain levels may result in cytotoxic effects in packaging or producer cell lines. 
Therefore, within certain embodiments of the invention, it may be desirable for these 
elements to be derived from virus variants selected for reduced cytotoxicity of their 
expressed structural proteins, reduced inhibition of host macromolecular synthesis, 
1 0 and/or the ability to establish persistent infection. 

To optimize vector packaging cell line performance and final vector titer, 
successive cycles of gene transfer and vector packaging may be performed. For 
example, supematants containing infectious packaged vector particles derived from 
vector transfection of the packaging cell lines, can be used to infect or "transduce" a 
1 5 fresh monolayer of alphavirus packaging cells. Successive transductions with packaged 
vector particles and fresh packaging cells may be preferred over nucleic acid 
transfection because of its higher RNA transfer efficiency into cells, optimized 
biological placement of the vector in the cell, and ability to "scale-up" the process for 
vector production from increasingly larger numbers of packaging cells. This leads to 
20 higher expression and higher titer of packaged infectious recombinant alphavirus vector. 

Within other aspects of the invention, a stably integrated or episomally 
maintained DNA expression vector can be used to produce the alphavirus vector RNA 
molecule within the cell. Briefly, such a DNA expression vector can be configured, in 
- preferred embodiments, to be inducible, such that trancription of the alphavirus vector 
25 RNA occurs only when cells have been propagated to a desired density, and are 
subsequently induced. Once transcribed, the alphavirus vector maintains the ability to 
self-replicate autocatalytically and triggers a cascade of events that culminate in 
packaged vector particle production. This approach allows for continued vector 
expression over extended periods of culturing because the integrated DNA vector 
30 expression system is maintained through a drug or other selection marker and the DNA 
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system, once induced, will constitutively express unaltered RNA vector replicons which 
cannot be diluted out by defective RNA copies. Production of larger-scale, high titer 
packaged alphavirus vector is possible in this alphavirus "producer cell line" 
configuration, the DNA-based alphavirus vector is introduced initially into the 
packaging cell line by transfection, since size restrictions could prevent packaging of 
the expression vector into a viral vector particle for transduction. 

H. Pharmaceutical Compositions 

As noted above, the present invention also provides pharmaceutical 
compositions comprising the gene delivery vehicles described herein in combination 
with a pharmaceutically acceptable carrier, diluent, or recipient. For example, within 
one embodiment, RNA or DNA vector constructs of the present invention can be 
lyophilized for long term storage and transport, and may be reconstituted prior to 
administration using a variety of substances, but are preferably reconstituted using 
water. In certain instances, dilute salt solutions which bring the final formulation to 
isotonicity may also be used. In addition, it may be advantageous to use aqueous 
solutions containing components which enhance the activity or physically protect the 
reconstituted nucleic acid preparation. Such components include cytokines, such as IL- 
2, polycations, such as protamine sulfate, lipid formulations, or other components. 
Lyophilized or dehydrated recombinant vectors may be reconstituted with any 
convenient volume of water or the reconstituting agents noted above that allow 
substantial, and preferably total solubilization of the lyophilized or dehydrated sample. 

Recombinant alphavirus particles or infectious recombinant virus (both 
-referred to as virus below) may be preserved either in crude or purified forms. In order 
to produce virus in a crude form, producing cells may first be cultivated in a bioreactor 
or flat stock culture, wherein viral particles are released from the cells into the culture 
media. Virus may then be preserved in crude form by first adding a sufficient amount 
of a formulation buffer to the culture media containing the recombinant virus to form an 
aqueous suspension. Within certain preferred embodiments, the formulation buffer is 
an aqueous solution that contains a saccharide, a high molecular weight structural 
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additive, and a buffering component in water. The aqueous solution may also contain 
one or more amino acids. 

The recombinant virus can also be preserved in a purified form. More 
specifically, prior to the addition of the formulation buffer, the crude recombinant virus 
5 described above may be clarified by passing it through a filter and then concentrated, 
such as by a cross flow concentrating system (Filtron Technology Corp., Nortborough, 
MA). Within one embodiment, DNase is added to the concentrate to digest exogenous 
DNA. The digest is then diafiltrated in order to remove excess media components and 
to establish the recombinant virus in a more desirable buffered solution. The diafiltrate 
10 is then passed over a Sephadex S-500 gel column and a purified recombinant virus is 
eluted. A sufficient amount of formulation buffer is then added to this eluate in order to 
reach a desired final concentration of the constituents and to minimally dilute the 
recombinant virus. The aqueous suspension may then be stored, preferably at -70°C, or 
immediately dried. As above, the formulation buffer may be an aqueous solution that 
15 contains a saccharide, a high molecular weight structural additive, and a buffering 
component in water. The aqueous solution may also contain one or more amino acids* 

Crude recombinant virus may also be purified by ion exchange column 
chromatography. Briefly, crude recombinant virus may be clarified by first passing it 
through a filter, followed by loading the filtrate onto a column containing a highly 
20 sulfonated cellulose matrix. The recombinant virus may then be eluted from the column 
in purified form by using a high salt buffer, and the high salt buffer exchanged for a 
more desirable buffer by passing the eluate over a molecular exclusion column. A 
sufficient amount of formulation buffer is then added, as discussed above, to the 
. purified recombinant virus and the aqueous suspension is either dried immediately or 
25 stored, preferably at -70°C. 

The aqueous suspension in crude or purified form can be dried by 
lyophilization or evaporation at ambient temperature. Briefly, lyophilization involves 
the steps of cooling the aqueous suspension below the gas transition temperature or 
below the eutectic point temperature of the aqueous suspension, and removing water 
30 from the cooled suspension by sublimation to form a lyophilized virus. Within one 
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embodiment, aliquots of the formulated recombinant virus are placed into an Edwards 
Refrigerated Chamber (3 shelf RC3S unit) attached to a freeze dryer (Supermoduiyo 
12K). A multistep freeze drying procedure as described by Phillips et al. (Cryobiology 
75:414, 1981) is used to lyophilize the formulated recombinant virus, preferably from a 

5 temperature of -40°C to -45°C. The resulting composition contains less than 10% water 
by weight of the lyophilized virus. Once lyophilized, the recombinant virus is stable 
and may be stored at -20°C to 25°C, as discussed in more detail below. 

Within the evaporative method; water is removed from the aqueous 
suspension at ambient temperature by evaporation. Within one embodiment, water is 

0 removed through spray-drying (EP 520,748). Within the spray-drying process, the 
aqueous suspension is delivered into a flow of preheated gas ? usually air, whereupon 
water rapidly evaporates from droplets of the suspension. Spray-drying apparatus are 
available from a number of manufacturers (e.g., Drytec, Ltd., Tonbridge, England; Lab- 
Plant, Ltd., Huddersfield, England). Once dehydrated, the recombinant virus is stable 

5 and may be stored at -20°C to 25°C. Within the methods described herein, the resulting 
moisture content of the dried or lyophilized virus may be determined through use of a 
Karl-Fischer apparatus (EM Science Aquastar* VI B volumetric titrator, Cherry Hill, 
NJ), or through a gravimetric method. 

The aqueous solutions used for formulation, as previously described, are 

0 preferably composed of a saccharide, high molecular weight structural additive, a 
buffering component, and water. The solution may also include one or more amino 
acids. The combination of these components act to preserve the activity of the 
recombinant virus upon freezing and lyophilization or drying through evaporation. 
' Although one saccharide that can be utilized is lactose, other saccharides may likewise 

5 be utilized including, for example, sucrose, mannitol, glucose, trehalose, inositol, 
fructose, maltose or galactose. In addition, combinations of saccharides can be used, for 
example, lactose and mannitol, or sucrose and mannitol. A particularly preferred 
concentration of lactose is 3%-4% by weight. Preferably, the concentration of the 
saccharide ranges from 1 % to 1 2% by weight. 
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The high molecular weight structural additive aids in preventing viral 
aggregation during freezing and provides structural support in the lyophilized or dried 
state. Within the context of the present invention, structural additives are considered to 
be of "high molecular weight" if they are greater than 5000 m.w. A preferred high 
5 molecular weight structural additive is human serum albumin. However, other 
substances may also be used, such as hydroxyethyl-cellulose, hydroxymethyl-cellulose, 
dextran, cellulose, gelatin, or povidone. A particularly preferred concentration of 
human serum albumin is 0.1% by weight. Preferably, the concentration of the high 
molecular weight structural additive ranges from 0.1% to 10% by weight. 

10 The amino acids, if present, function to further preserve viral infectivity 

upon cooling and thawing of the aqueous suspension. In addition, amino acids function 
to further preserve viral infectivity during sublimation of the cooled aqueous suspension 
and while in the lyophilized state. A preferred amino acid is arginine, but other amino 
acids such as lysine, ornithine, serine, glycine, glutamine, asparagine, glutamic acid or 

15 aspartic acid can also be used. A particularly preferred arginine concentration is 0.1% 
by weight. Preferably, the amino acid concentration ranges from 0.1% to 10%* by 
weight. 

The buffering component acts to buffer the solution by maintaining a 

relatively constant pH. A variety of buffers may be used, depending on the pH range 
20 desired, preferably between 7.0 and 7.8. Suitable buffers include phosphate buffer and 

citrate buffer. A particularly preferred pH of the recombinant virus formulation is 7.4, 

and a preferred buffer is tromethamine. 

In addition, it is preferable that the aqueous solution contain a neutral 

salt which is used to adjust the final formulated recombinant alphavirus to an 
25 appropriate iso-osmotic salt concentration. Suitable neutral salts include sodium 

chloride, potassium chloride or magnesium chloride. A preferred salt is sodium 

chloride. 

Aqueous solutions containing the desired concentration of the 
components described above may be prepared as concentrated stock solutions. 
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It will be evident to those skilled in the art, given the disclosure provided 
herein, that it may be preferable to utilize certain saccharides within the aqueous 
solution when the lyophilized virus is intended for storage at room temperature. More 
specifically, it is preferable to utilize disaccharides, such as lactose or trehalose. 
5 particularly for storage at room temperature. 

The lyophilized or dehydrated viruses of the subject invention may be 
reconstituted using a variety of substances, but are preferably reconstituted using water. 
In certain instances, dilute salt solutions which bring the final formulation to isotonicity 
may also be used. In addition, it may be advantageous to use aqueous solutions 

10 containing components known to enhance the activity of the reconstituted virus. Such 
components include cytokines, such as IL-2, polycations, such as protamine sulfate, or 
other components which enhance the transduction efficiency of the reconstituted virus. 
Lyophilized or dehydrated recombinant virus may be reconstituted with any convenient 
volume of water or the reconstituting agents noted above that allow substantial, and 

1 5 preferably total solubilization of the lyophilized or dehydrated sample. 

I- Methods for Utilizing Gene Delivery Vehicles 

As noted above, the present invention also provides methods for 

delivering a selected heterologous sequence to a vertebrate (e.g., a mammal such as a 
20 human or other warm-blooded animal such as a horse, cow, pig, sheep, dog, cat, rat or 

mouse) or insect, comprising the step of administering to a vertebrate or insect a gene 

delivery vehicle as described herein which is capable of expressing the selected 

heterologous sequence. Such gene delivery vehicles may be administered either 
- directly {e.g., intravenously, intramuscularly, intraperitoneally, subcutaneously, orally, 
25 rectally, intraocularly, intranasally), or by various physical methods such as lipofection 

(Feigner et al., Proc. Nail. Acad. Sci. USA £4:7413-7417, 1989), direct DNA injection 

(Fung et al., Proc. Natl. Acad. Sci. USA 50:353-357, 1983; Seeger et al., Proc. Natl. 

Acad. Sci. USA 57:5849-5852; Acsadi et al., Nature J52:815-818, 1991); 

microprojectile bombardment (Williams et al., PNAS 55:2726-2730, 1991); liposomes 
30 of several types (see, e.g.. Wang et al., PNAS 5*7851-7855, 1987); CaP0 4 (Dubensky 

et al., PNAS 57:7529-7533, 1984); DNA Iigand (Wu et al, J. Biol. Chem. 26V: 16985- 
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16987, 1989); administration of nucleic acids alone (WO 90/11092); or administration 
of DNA linked to killed adenovirus (Curiel et aL, Hum. Gene Ther. 3:147-154, 1992); 
via polycation compounds such as polylysine, utilizing receptor specific ligands; as well 
as with psoralen inactivated viruses such as Sendai or Adenovirus. In addition, the gene 
5 delivery vehicles may either be administered directly {i.e., in vivo), or to cells which 
have been removed (ex vivo), and subsequently returned. 

As discussed in more detail below, gene delivery vehicles may be 
administered to a vertebrate or insect for a . wide variety of therapeutic and/or other 
productive purposes, including for example, for the purpose of stimulating a specific 
1 0 immune response; inhibiting the interaction of an agent with a host cell receptor; to 
express a toxic palliative, including for example, conditional toxic palliatives; to 
immunologically regulate the immune system; to prevent cell division, to express 
markers, for replacement gene therapy, to promote wound healing and/or to produce a 
recombinant protein. These and other uses are discussed in more detail below. 

15 

1. Immunostimulation 

Within one aspect of the present invention, compositions and methods 
are provided for administering a gene delivery vehicle which is capable of preventing, 
inhibiting, stabilizing or reversing infectious, cancerous, auto- immune or immune 
20 diseases. Representative examples of such diseases include viral infections such as 
HIV, HBV, HCV, HTLV I, HTLV II, CMV, EBV and HPV, melanomas, diabetes, graft 
vs. host disease, Alzheimer's disease and heart disease. More specifically, within one 
aspect of the present invention, compositions and methods are provided for stimulating 
^ an immune response (either humoral or cell-mediated) to a pathogenic agent, such that 
25 the pathogenic agent is either killed or inhibited. Representative examples of 
pathogenic agents include bacteria, fungi, parasites, viruses and cancer cells. 

Within one embodiment of the invention the pathogenic agent is a virus, 
and methods are provided for stimulating a specific immune response and inhibiting 
viral spread by using a gene delivery vehicle that directs the expression of an antigen or 
30 modified form thereof to susceptible target cells capable of either (1) initiating an 
immune response to the viral antigen or (2) preventing the viral spread by occupying 
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cellular receptors required for viral interactions. Expression of the vector nucleic acid 
encoded protein may be transient or stable with time. Where an immune response is to 
be stimulated to a pathogenic antigen, the gene delivery vehicle is preferably designed 
to express a modified form of the antigen which will stimulate an immune response and 
5 which has reduced pathogenicity relative to the native antigen. This immune response 
is achieved when cells present antigens in the correct manner, i.e., in the context of the 
MHC class I and/or II molecules along with accessory molecules such as CD3, ICAM- 
1, ICAM-2, LFA-1, or analogues thereof (e.g., Altmann et al., Nature 338:5\2, 1989). 
Cells infected with gene delivery vehicles are expected to do this efficiently because 
1 0 they closely mimic genuine viral infection and because they: (a) are able to infect non- 
replicating cells, (b) do not integrate into the host cell genome, (c) are not associated 
with any life threatening diseases, and (d) express high levels of heterologous protein. 
Because of these differences, gene delivery vehicles can easily be thought of as safe 
viral vectors which can be used on healthy individuals for vaccine use. 
15 This aspect of the invention has a further advantage over other systems 

that might be expected to function in a similar manner, in that the presenter cells are 
fully viable and healthy, and low levels of viral antigens, relative to heterologous genes, 
are expressed. This presents a distinct advantage since the antigenic epitopes expressed 
can be altered by selective cloning of sub-fragments of the gene for the antigen into the 
20 recombinant alphavirus, leading to responses against immunogenic epitopes which may 
otherwise be overshadowed by immunodominant epitopes. Such an approach may be 
extended to the expression of a peptide having multiple epitopes, one or more of the 
epitopes being derived from different proteins. Further, this aspect of the invention 
allows efficient stimulation of cytotoxic T lymphocytes (CTL) directed against 
25 antigenic epitopes, and peptide fragments of antigens encoded by sub-fragments of 
genes, through intracellular synthesis and association of these peptide fragments with 
MHC Class I molecules. This approach may be utilized to map major 
immunodominant epitopes for CTL induction. 

An immune response may also be achieved by transferring to an 
30 appropriate immune cell (such as a T lymphocyte) the gene for the specific T cell 
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receptor which recognizes the antigen of interest (in the context of an appropriate MHC 
molecule if necessary), for an immunoglobulin which recognizes the antigen of interest, 
or for a hybrid of the two which provides a CTL response in the absence of the MHC 
context. Thus, the gene delivery vehicle cells may be used as an immunostimulant, 
5 immunomodulator, or vaccine. 

In another embodiment of the invention, methods are provided for 
producing inhibitor palliatives wherein gene delivery vehicles deliver and express 
defective interfering viral structural proteins, which inhibit viral assembly. Such gene 
delivery vehicles may encode defective gag, pol, env or other viral particle proteins or 

10 peptides and these would inhibit in a dominant fashion the assembly of viral particles. 
This occurs because the interaction of normal subunits of the viral particle is disturbed 
by interaction with the defective subunits. 

In another embodiment of the invention, methods are provided for the 
expression of inhibiting peptides or proteins specific for viral protease. Briefly, viral 

1 5 protease cleaves the viral gag and gaglpol proteins into a number of smaller peptides. 
Failure of this cleavage in all cases leads to complete inhibition of production* of 
infectious retroviral particles. As an example, the HIV protease is known to be an 
aspartyl protease and these are known to be inhibited by peptides made from amino 
acids from protein or analogues. Gene delivery vehicles to inhibit HIV will express one 

20 or multiple fused copies of such peptide inhibitors. 

Another embodiment involves the delivery of suppressor genes which, 
when deleted, mutated, or not expressed in a cell type, lead to tumorigenesis in that cell 
type. Reintroduction of the deleted gene by means of a gene delivery vehicle leads to 
* regression of the tumor phenotype in these cells. Examples of such cancers are 

25 retinoblastoma and Wilms Tumor. Since malignancy can be considered to be an 
inhibition of cellular terminal differentiation compared with cell growth, the alphavirus 
vector delivery and expression of gene products which lead to differentiation of a tumor 
should also, in general, lead to regression. 

In yet another embodiment, the gene delivery vehicle provides a 

30 therapeutic effect by transcribing a ribozyme (an RNA enzyme) (HaselofT and Gerlach, 
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Nature 35*585, 198?) which will cleave and hence inactivate RNA molecules 
corresponding to a pathogenic function. Since ribozymes function by recognizing a 
specific sequence in the target RNA and this sequence is normally 12 to 17 bp, this 
allows specific recognition of a particular RNA species such as a RNA or a retroviral 
5 genome. Additional specificity may be achieved in some cases by making this a 
conditional toxic palliative (see below). 

One way of increasing the effectiveness of inhibitory palliatives is to 
express viral inhibitory genes in conjunction with the expression of genes which 
increase the probability of infection of the resistant cell by the vims in question. The 

10 result is a nonproductive "dead-end" event which would compete for productive 
infection events. In the specific case of HIV, gene delivery vehicles may be delivered 
which inhibit HIV replication (by expressing anti-sense tat, etc., as described above) 
and also overexpress proteins required for infection, such as CD4. In this way, a 
relatively small number of vector-infected HIV-resistant cells act as a "sink" or 

15 "magnet" for multiple nonproductive fusion events with free virus or virally infected 
cells. 

2. Blocking Agents 

Many infectious diseases, cancers, autoimmune diseases, and other 
20 diseases involve the interaction of viral particles with cells, cells with cells, or cells with 
factors produced by themselves or other cells. In viral infections, viruses commonly 
enter cells via receptors on the surface of susceptible cells. In cancers or other 
proliferative conditions (e.g., restenosis), cells may respond inappropriately or not at all 
* to signals from other cells or factors, or specific factors may be mutated, overexpressed, 
25 or underexpressed, resulting in loss of appropriate cell cycle control. In autoimmune 
disease, there is inappropriate recognition of "self 1 markers. Within the present 
invention, such interactions may be blocked by producing, in v/vo, an analogue to either 
of the partners in an interaction. Alternatively, cell cycle control may be restored by 
preventing the transition from one phase to another (e.g., Gl to S phase) using a 
30 blocking factor which is absent or underexpressed. This blocking action may occur 
intracellularly, on the cell membrane, or extracellularly, and the action of an alphavirus 
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vector carrying a gene for a blocking agent, can be mediated either from inside a 
susceptible cell or by secreting a version of the blocking protein to locally block the 
pathogenic interaction. 

In the case of HIV, the two agents of interaction are the gp 120/gp 41 
5 envelope protein and the CD4 receptor molecule. Thus, an appropriate blocker would 
be a gene delivery vehicle expressing either an HIV env analogue that blocks HIV entry 
without causing pathogenic effects, or a CD4 receptor analogue. The CD4 analogue 
. would be secreted and would function to protect neighboring cells, while the gp 120/gp 
41 is secreted or produced only intracellular^ so as to protect only the vector- 

10 containing cell. It may be advantageous to add human immunoglobulin heavy chains or 
other components to CD4 in order to enhance stability or complement lysis. 
Administration of a gene delivery vehicle encoding such a hybrid-soluble CD4 to a host 
results in a continuous supply of a stable hybrid molecule. Efficacy of treatment can be 
assayed by measuring the usual indicators of disease progression, including antibody 

15 level, viral antigen production, infectious HFV levels, or levels of nonspecific 
infections. 

In the case of uncontrolled proliferative states, such as cancer or 
restenosis, cell cycle progression may be halted by the expression of a number of 
different factors that affect signaling by cyclins or cyclin-dependent kinases (CDK). 

20 For example, the cyclin-dependent kinase inhibitors, pi 6, p21, and p27 each regulate 
cycIinrCDK mediated cell cycle signaling. Overexpression of these factors within a cell 
by a gene delivery vehicle results in a cytostatic suppression of cell proliferation. Other 
factors that may be used therapeutically, as blocking agents or targets to disrupt cell 
* proliferation, include, for example, wild-type or mutant Rb, p53, Myc, Fos, Jun, PCNA, 

25 GAX, lenti viral vpr and pi 5. Within related embodiments, cardiovascular diseases such 
as restenosis or atherosclerosis may be treated or prevented with vectors that express 
products which promote re-endothelialization, or vascular remodeling (e.g., VEGF, 
TFPI, SOD). 
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3. Expression of Palliatives 

Techniques similar to those described above can be used to produce gene 
delivery vehicles which direct the expression of an agent (or "palliative") which is 
capable of inhibiting a function of a pathogenic agent or gene. Within the present 
5 invention, "capable of inhibiting a function" means that the palliative either directly 
inhibits the function or indirectly does so, for example, by converting an agent present 
in the cells from one which would not normally inhibit a function of the pathogenic 
agent to one which does. Examples of such functions for viral diseases include 
adsorption, replication, gene expression, assembly, and exit of the virus from infected 
1 0 cells. Examples of such functions for a cancerous cell, cancer-promoting growth factor, 
or uncontrolled proliferative condition (<?.#, restenosis) include viability, cell 
replication, altered susceptibility to external signals (e.g., contact inhibition), and lack 
of production or production of mutated forms of anti-oncogene proteins. 

15 a. Inhibitor Palliatives 

In one aspect of the present invention, the gene delivery vehicle directs 
the expression of a gene which can interfere with a function of a pathogenic agent, for 
instance in viral or malignant diseases. Such expression may either be essentially 
continuous or in response to the presence in the cell of another agent associated either 

20 with the pathogenic condition or with a specific cell type (an "identifying agent"). In 
addition, vector delivery may be controlled by targeting vector entry specifically to the 
desired cell type (for instance, a virally infected or malignant cell) as discussed above. 

One method of administration is leukophoresis, in which about 20% of 
-an individual's PBLs are removed at any one time and manipulated in vitro. Thus, 
25 approximately 2x10* cells may be treated and replaced. Repeat treatments may also be 
performed. Alternatively, bone marrow may be treated and allowed to amplify the 
effect as described above. In addition, packaging cell lines producing a vector may be 
directly injected into a subject, allowing continuous production of recombinant virions. 

In one embodiment, gene delivery vehicles which express RNA 
30 complementary to key pathogenic gene transcripts (for example, a viral gene product or 
an activated cellular oncogene) can be used to inhibit translation of that transcript into 
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protein, such as the inhibition of translation of the HIV tat protein. Since expression of 
this protein is essential for viral replication, cells containing the gene delivery vehicle 
would be resistant to HIV replication^ 

In a second embodiment, where the pathogenic agent is a single-stranded 
5 virus having a packaging signal, RNA complementary to the viral packaging signal 
{e.g., an HIV packaging signal when the palliative is directed against HIV) is expressed, 
so that the association of these molecules with the viral packaging signal will, in the 
case of retroviruses, inhibit stem loop formation or tRNA primer binding required for 
proper encapsidation or replication of the alphavirus RNA genome. 

10 In a third embodiment, a gene delivery vehicle may be introduced which 

expresses a palliative capable of selectively inhibiting the expression of a pathogenic 
gene, or a palliative capable of inhibiting the activity of a protein produced by the 
pathogenic agent. In the case of HIV, one example is a mutant tat protein which lacks 
the ability to transactivate expression from the HIV LTR and interferes (in a 

15 transdominant manner) with the normal functioning of tat protein. Such a mutant has 
been identified for HTLV II tat protein ("XII Leu 5 " mutant; see Wachsman etal., 
Science 255:674, 1987). A mutant transrepressor tat should inhibit replication much as 
has been shown for an analogous mutant repressor in HSV-1 (Friedmann et al., Nature 
JJ5:452, 1988). 

20 Such a transcriptional repressor protein can be selected for in tissue 

culture using any viral-specific transcriptional promoter whose expression is stimulated 
by a virus-specific transactivating protein (as described above). In the specific case of 
HIV, a cell line expressing HIV tat protein and the HSVTK gene driven by the HIV 
. promoter will die in the presence of AC V. However, if a series of mutated tat genes are 

25 introduced to the system, a mutant with the appropriate properties (i.e., represses 
transcription from the HIV promoter in the presence of wild-type tat) will grow and be 
selected. The mutant gene can then be reisolated from these cells. A cell line 
containing multiple copies of the conditionally lethal vector/tat system may be used to 
assure that surviving cell clones are not caused by endogenous mutations in these genes. 

30 A battery of randomly mutagenized tat genes are then introduced into these cells using a 
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"rescuable" alphavirus vector (i.e., one that expresses the mutant tat protein and 
contains a bacterial origin of replication and drug resistance marker for growth and 
selection in bacteria). This allows a large number of random mutations to be evaluated 
and permits facile subsequent molecular cloning of the desired mutant cell line. This 
5 procedure may be used to identify and utilize mutations in a variety of viral 
transcriptional activator/viral promoter systems for potential antiviral therapies. 

b. Conditional Toxic Palliatives 

Another approach for inhibiting a pathogenic agent is to express a 
10 palliative which is toxic for the cell expressing the pathogenic condition. In this case, 
expression of the palliative from the gene delivery vehicle should be limited by the 
presence of an entity associated with the pathogenic agent, such as a specific viral RNA 
sequence identifying the pathogenic state, in order to avoid destruction of 
nonpathogenic cells. 

15 In one embodiment of this method, a gene delivery vehicle can be 

utilized to express a toxic gene (as discussed above) from a cell-specific responsive 
vector. In this manner, rapidly replicating cells, which contain the RNA sequences 
capable of activating the cell-specific responsive vectors, are preferentially destroyed by 
the cytotoxic agent produced by the gene delivery vehicle. 

20 In a similar manner to the preceding embodiment, the gene delivery 

vehicle can carry a gene for phosphorylation, phosphoribosylation, ribosylation, or 
other metabolism of a purine- or pyrimidine-based drug. This gene may have no 
equivalent in mammalian cells and might come from organisms such as a virus, 
- bacterium, fungus, or protozoan. An example of this would be the £ coli guanine 

25 phosphoribosyl transferase gene product, which is lethal in the presence of thioxanthine 
(see Besnard etal., Mol. Cell. Biol. 7:4139-4141, 1987). Conditionally lethal gene 
products of this type (also referred to as "prodrugs converting enzymes" above) have 
application to many presently known purine- or pyrimidine-based anticancer drugs, 
which often require intracellular ribosylation or phosphorylation in order to become 

30 effective cytotoxic agents. The conditionally lethal gene product could also metabolize 
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a nontoxic drug which is not a purine or pyrimidine analogue to a cytotoxic form (see 
Searle et al., Brit. 1 Cancer 53:377-384, 1986). 

Mammalian viruses in general tend to have "immediate early" genes 
which are necessary for subsequent transcriptional activation from other viral promoter 
5 elements. RNA sequences of this nature are excellent candidates for activating 
alphavirus vectors intracellular signals (or "identifying agents") of viral infection. 
Thus, conditionally lethal genes expressed from alphavirus cell-specific vectors 
responsive to these viral "immediate early" gene products could specifically kill cells 
infected with any particular virus. Additionally, since the human and interferon 

10 promoter elements are transcriptionally activated in response to infection by a wide 
variety of nonrelated viruses, the introduction of vectors expressing a conditionally 
lethal gene product like HSVTK. for example, in response to interferon production 
could result in the destruction of cells infected with a variety of different viruses. 

In another aspect of the present invention, gene delivery vehicles are 

15 provided which direct the expression of a gene product capable of activating an 
otherwise inactive precursor into an active inhibitor of the pathogenic agent. For 
example, the HSVTK gene product may be used to more effectively metabolize poten- 
tially antiviral nucleoside analogues such as AZT or ddC. The HSVTK gene may be 
expressed under the control of a cell-specific responsive vector and introduced into 

20 these cell types. AZT (and other nucleoside antivirals) must be metabolized by cellular 
mechanisms to the nucleotide triphosphate form in order to specifically inhibit retroviral 
reverse transcriptase, and thus, HIV replication (Furmam et al M Proc. Natl Acad. Set 
USA 55:8333-8337, 1986). Constitutive expression of HSVTK (a nucleoside and 
- nucleoside kinase with very broad substrate specificity) results in more effective 

25 metabolism of these drugs to their biologically active nucleotide triphosphate form. 
AZT or ddC therapy will thereby be more effective, allowing lower doses, less 
generalized toxicity, and higher potency against productive infection. Additional 
nucleoside analogues whose nucleotide triphosphate forms show selectivity for 
retroviral reverse transcriptase but, as a result of the substrate specificity of cellular 
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nucleoside and nucleotide kinases are not phosphorylated, will be made more 
efficacious. 

Administration of these gene delivery vehicles to human T cell and 
macrophage/monocyte cell lines can increase their resistance to HIV in the presence of 
5 AZT and ddC compared to the same cells without retroviral vector treatment. 
Treatment with AZT would be at lower than normal levels to avoid toxic side effects 
but still efficiently inhibit the spread of HIV. The course of treatment would be as 
described for the blocker. 

In one embodiment, the gene delivery vehicle carries a gene specifying a 

1 0 product which is not in itself toxic but, when processed or modified by a protein such as 
a protease specific to a viral or other pathogen, is converted into a toxic form. For 
example, the gene delivery vehicle could carry a gene encoding a proprotein for ricin A 
chain, which becomes toxic upon processing by the HIV protease. More specifically, a 
synthetic inactive proprotein form of the toxin ricin or diphtheria A chains could be 

15 cleaved to the active form by arranging for the HIV virally encoded protease to 
recognize and cleave off an appropriate "pro" element. 

In another embodiment, the gene delivery vehicle may express a 
"reporting product" on the surface of the target cells in response to the presence of an 
identifying agent in the cells (such as expression of a viral gene). This surface protein 

20 can be recognized by a cytotoxic agent, such as antibodies for the reporting protein, or 
by cytotoxic T cells. In a similar manner, such a system can be used as a detection 
system (see below) to simply identify those cells having a particular gene which 
expresses an identifying protein. 

Similarly, in another embodiment, a surface protein could be expressed 

25 which would itself be therapeutically beneficial. In the particular case of HIV, 
expression of the human CD4 protein specifically in HIV-infected cells may be 
beneficial in two ways: 

1. Binding of CD4 to HIV env intracellularly could inhibit the 
formation of viable viral particles, much as soluble CD4 has been shown to do for free 
30 virus, but without the problem of systematic clearance and possible immunogenicity, 
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since the protein will remain membrane bound and is structurally identical to 
endogenous CD4 (to which the patient should be immunologically tolerant). 

2. Since the CD4/HIV env complex has been implicated as a cause 
of cell death, additional expression of CD4 (in the presence of excess HIV-ewv present 
5 in HIV-infected cells) leads to more rapid cell death and thus inhibits viral 
dissemination. This may be particularly applicable to monocytes and macrophages, 
which act as a reservoir for virus production as a result of their relative refractility to 
HIV-induced cytotoxicity (which, in turn, is apparently due to the relative lack of CD4 
on their cell surfaces). 

10 In another embodiment, the gene delivery vehicle can provide a 

ribozyme which will cleave and inactivate RNA molecules essential for viability of the 
vector infected cell. By making ribozyme production dependent on a specific RNA 
sequence corresponding to the pathogenic state, such as HIV tat, toxicity is specific to 
the pathogenic state. 

15 

4. Expression of Markers 

The above-described technique of expressing a palliative in a cell in 
response to a specific RNA sequence can also be modified to enable detection of a 
particular gene in a cell which expresses an identifying protein (for example, a gene 
20 carried by a particular virus), and hence enable detection of cells carrying that virus. In 
addition, this technique enables the detection of viruses (such as HIV) in a clinical 
sample of cells carrying an identifying protein associated with the virus. 

This modification can be accomplished by providing a genome coding 
- for a product, the presence of which can be readily identified (the "marker product"), in 
25 a gene delivery vehicle which responds to the presence of the identifying protein in the 
infected cells. For example, HIV, when it infects suitable cells, makes tat and rev. The 
indicator cells can thus be provided with a genome (such as by infection with an 
appropriate recombinant alphavirus) which codes for a marker gene, such as the alkaline 
phosphatase gene, P-galactosidase gene, or the luciferase gene which is expressed by 
30 the recombinant alphavirus upon activation by the tat and/or rev RNA transcript. In the 
case of P-galactosidase or alkaline phosphatase, exposing the cells to substrate 
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analogues results in a color or fluorescence change if the sample is positive for HIV. In 
the case of luciferase, exposing the sample to luciferin will result in luminescence if the 
sample is positive for HIV. For intracellular enzymes such as P-galactosidase, the viral 
titre can be measured directly by counting colored or fluorescent cells, or by making 
5 cell extracts and performing a suitable assay. For the membrane bond form of alkaline 
phosphatase, virus titre can also be measured by performing enzyme assays on the cell 
surface using a fluorescent substrate. For secreted enzymes, such as an engineered form 
of alkaline phosphatase, small samples of culture supernatant are assayed for activity, 
allowing continuous monitoring of a single culture over time. Thus, different forms of 
10 this marker system can be used for different purposes. These include counting active 
virus, or sensitively and simply measuring viral spread in a culture and the inhibition of 
this spread by various drugs. 

Further specificity can be incorporated into the preceding system by 
testing for the presence of the virus either with or without neutralizing antibodies to that 
15 virus. For example, in one portion of the clinical sample being tested, neutralizing 
antibodies to HIV may be present; whereas in another portion there would be no 
neutralizing antibodies. If the tests were negative in the system where there were 
antibodies and positive where there were no antibodies, this would assist in confirming 
the presence of HIV. 

20 Within an analogous system for an in vitro assay, the presence of a 

particular gene, such as a viral gene, may be determined in a cell sample. In this case, 
the cells of the sample are infected with a suitable gene delivery vehicle which carries 
the reporter gene which is only expressed in the presence of the appropriate viral RNA 
* transcript. The reporter gene, after entering the sample cells, will express its reporting 

25 product (such as P-galactosidase or luciferase) only if the host cell expresses the 
appropriate viral proteins. 

These assays are more rapid and sensitive, since the reporter gene can 
express a greater amount of reporting product than identifying agent present, which 
results in an amplification effect. 
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5. Immune Down-Regulation 

As described above, the present invention also provides gene delivery 
vehicles capable of suppressing one or more elements of the immune system in target 
cells infected with the alphavirus. Briefly, specific down-regulation of inappropriate or 
5 unwanted immune responses, such as in chronic hepatitis or in transplants of 
heterologous tissue such as bone marrow, can be engineered using imrnune-suppressive 
viral gene products which suppress surface expression of transplantation (MHC) 
antigen. Group C adenoviruses Ad2 and Ad5 possess a 19 kd glycoprotein (gp 19) 
encoded in the E3 region of the virus. This gp 19 molecule binds to class I MHC 

10 molecules in the endoplasmic reticulum of cells, and prevents terminal glycosylation 
and translocation of class I MHC to the cell surface. For example, prior to bone marrow 
transplantation, donor bone marrow cells may be infected with a gp 1 9-encoding gene 
delivery vehicle which, upon expression of the gp 19, inhibit the surface expression of 
MHC class I transplantation antigens. These donor cells may be transplanted with low 

15 risk of graft rejection and may require a minimal immunosuppressive regimen for the 
transplant patient. This may allow an acceptable donor-recipient chimeric state to exist 
with fewer complications. Similar treatments may be used to treat the range of so- 
called autoimmune diseases, including lupus erythromiatis, multiple sclerosis, 
rheumatoid arthritis or chronic hepatitis B infection. 

20 An alternative method involves the use of anti-sense message, ribozyme, 

or other specific gene expression inhibitor specific for T cell clones which are 
autoreactive in nature. These block the expression of the T cell receptor of particular 
unwanted clones responsible for an autoimmune response. The anti-sense, ribozyme, or 
- other gene may be introduced using the viral vector delivery system. 

25 

6. Replacement or Augmentation Gene Therapy 

One further aspect of the present invention relates to transforming cells 
of a vertebrate or insect with a gene delivery vehicle which supplies genetic sequences 
capable of expressing a therapeutic protein. Within one embodiment of the present 
30 invention, the gene delivery vehicle is designed to express a therapeutic protein capable 
of preventing, inhibiting, stabilizing or reversing an inherited or noninherited genetic 
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defect in metabolism, immune regulation, hormonal regulation, enzymatic or membrane 
associated structural function. This embodiment also describes the gene delivery 
vehicle capable of transducing individual cells, whereby the therapeutic protein is able 
to be expressed systemically or locally from a specific cell or tissue, whereby the 
5 therapeutic protein is capable of (a) the replacement of an absent or defective cellular 
protein or enzyme, or (b) supplement production of a defective of low expressed 
cellular protein or enzyme. Such diseases may include cystic fibrosis, Parkinson's 
disease, hypercholesterolemia, adenosine deaminase deficiency, B-globin disorders, 
Hemophilia A & B, Gaucher's disease, diabetes and leukemia. 

10 As an example of the present invention, a gene delivery vehicle can be 

constructed and utilized to treat Gaucher disease. Briefly, Gaucher disease is a genetic 
disorder that is characterized by the deficiency of the enzyme glucocerebrosidase. This 
type of therapy is an example of a single gene replacement therapy by providing a 
functional cellular enzyme. This enzyme deficiency leads to the accumulation of 

15 glucocerebroside in the lysosomes of all cells in the body. However, the disease 
phenotype is manifested only in the macrophages, except in the very rare neuronpathic 
forms of the disease. The disease usually leads to enlargement of the liver and spleen 
and lesions in the bones. (For a review, see Science 256:794, 1992, and The Metabolic 
Basis of Inherited Disease, 6th ed., Scriver et al., vol. 2, p. 1677). 

20 Gene delivery vehicles can similarly be utilized to deliver a wide variety 

of therapeutic proteins in order to treat, cure, prevent a disease or disease process. 
Representative examples of such genes include, but are not limited to, insulin (see U.S. 
4,431,740 and BE 885196A), hemoglobin (Lawn et al. Cell 27:647-51, 1980), 
' erythropoietin (EPO; see U.S. 4,703,008), megakaryocyte growth and differentiation 

25 factor (MGDF), stem cell factor (SCF), G-CSF (Nagata et aL, Nature 5/9:415-418, 
1986), GM-CSF, M-CSF (see WO 8706954), the flt3 ligand (Lyman et al. (1993), Cell 
75:1157-1167), EGF, acidic and basic FGF, PDGF, members of the interleukin or 
interferon families, supra, neurotropic factors (e.g., BDNF; Rosenthal et al., 
Endocrinology 729:1289-1294, 1991, NT-3; see WO 9103569, CNTF; see WO 

30 9104316, NGF; see WO 9310150), coagulation factors (e.g., factors VIII and IX), 
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thrombolytic factors such as t-PA (see EP 292009, AU 8653302 and EP 174835) and 
streptokinase (see EP 407942), human growth hormone (see JP 94030582 and U.S. 
4,745,069) and other animal somatotropins, integrins and other cell adhesion molecules, 
such as ICAM-1 and ELAM (see also other "heterologous sequences" discussed above), 
5 and other growth factors, such as IGF-I and IGF-II, TGF-p, osteogenic protein- 1 
(Ozkaynak et al., EMBOJ. P:2085-2093, 1990), and other bone morphogenetic proteins 
(e.g., BMP-4, Nakase et al, J. Bone Miner. Res, 9:65 1 -659, 1 994). 

7. Lvmphokines and Lvmnhokine Receptors 
10 As noted above, the present invention also provides gene delivery 

vehicles which can, among other functions, direct the expression of one or more 
cytokines or cytokine receptors. Briefly, in addition to their role as cancer therapeutics, 
cytokines can have negative effects resulting in certain pathological conditions. For 
example, most resting T-cells, B cells, large granular lymphocytes and monocytes do 

15 not express IL-2R (receptor). In contrast to the lack of IL-2R expression on normal 
resting cells, IL-2R is expressed by abnormal cells in patients with certain leukemias 
(ATL, Hairy-cell, Hodgkins, acute and chronic granulocytic), autoimmune diseases, and 
is associated with allograft rejection. Interestingly, in most of these patients the serum 
concentration of a soluble form of IL-2R is elevated. Therefore, with certain 

20 embodiments of the invention therapy may be effected by increasing the serum 
concentration of the soluble form of the cytokine receptor. For example, in the case of 
IL-2R, a gene delivery vehicle can be engineered to produce both soluble IL-2R and IL- 
2R, creating a high affinity soluble receptor. In this configuration, serum IL-2 levels 
. would decrease, inhibiting the paracrine loop. This same strategy also may be effective 

25 against autoimmune diseases. In particular, because some autoimmune diseases (eg., 
Rheumatoid arthritis, SLE) also are associated with abnormal expression of IL-2, 
blocking the action of IL-2 by increasing the serum level of receptor may also be 
utilized in order to treat such autoimmune diseases. 

In other cases inhibiting the levels of IL-1 may be beneficial. Briefly, 

30 IL-1 consists of two polypeptides, IL-1 and IL-1, each of which has plieotropic effects. 
IL-1 is primarily synthesized by mononuclear phagocytes, in response to stimulation by 
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microbial products or inflammation. There is a naturally occurring antagonist of the 
IL-1R, referred to as the IL-1 Receptor antagonist ("IL-IRa"). This IL-1R antagonist 
has the same molecular size as mature IL-1 and is structurally related to it. However, 
binding of IL-IRa to the IL-1R does not initiate any receptor signaling. Thus, this 
5 molecule has a different mechanism of action than a soluble receptor, which complexes 
with the cytokine and thus prevents interaction with the receptor. IL-1 does not seem to 
play an important role in normal homeostasis. In animals, antibodies to IL-1 receptors 
reduce inflammation and anorexia due to endotoxins and other inflammation inducing 
agents. 

0 In the case of septic shock, IL-1 induces secondary compounds which 

are potent vasodilators. In animals, exogenously supplied IL-1 decreases mean arterial 
pressure and induces leukopenia. Neutralizing antibody to IL-1 reduced endotoxin- 
induced fever in animals. In a study of patients with septic shock who were treated with 
a constant infusion of IL-1 R for three days, the 28 day mortality was 16% compared to 
15 44% in patients who received placebo infusions. In the case of autoimmune disease, 
reducing the activity of IL-1 reduces inflammation. Similarly, blocking the activity of 
IL-1 with recombinant receptors can result in increased allograft survival in animals, 
again presumably by decreasing inflammation. 

These diseases provide further examples where gene delivery vehicles 
20 may be engineered to produce a soluble receptor or more specifically the IL-IRa 
molecule. For example, in patients undergoing septic shock, a single injection of IL- 
lRa producing vector particles could replace the current approach requiring a constant 
infusion of recombinant IL-1R. 

Cytokine responses, or more specifically, incorrect cytokine responses 
25 may also be involved in the failure to control or resolve infectious diseases. Perhaps the 
best studied example is non-healing forms of leishmaniasis in mice and humans which 
have strong, but counterproductive T H 2-dominated responses. Similarly, 
lepromotomatous leprosy is associated with a dominant, but inappropriate T H 2 response. 
In these conditions, gene delivery vehicles may be useful for increasing circulating 
30 levels of IFN gamma, as opposed to the site-directed approach proposed for solid tumor 
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therapy. I FN gamma is produced by T H -1 T-cells, and functions as a negative regulator 
of T ir 2 subtype proliferation. IFN gamma also antagonizes many of the IL-4 mediated 
effects on B-cells, including isotype switching to IgE. 

IgE, mast cells and eosinophils are involved in mediating allergic 
5 reaction. IL-4 acts on differentiating T-cells to stimulate T H -2 development, while 
inhibiting T H -1 responses. Thus, alphavirus-based gene therapy may also be 
accomplished in conjunction with traditional allergy therapeutics. One possibility is to 
deliver a gene delivery vehicle which produces IL4R with small amounts of the 
offending allergen {i.e., traditional allergy shots). Soluble IL-4R would prevent the 
1 0 activity of IL-4, and thus prevent the induction of a strong T H -2 response. 

8. Suicide Vectors 

One further aspect of the present invention relates to the use of gene 
delivery vehicle suicide vectors to limit the spread of wild-type alphavirus in the 

15 packaging/producer cell lines. Briefly, within one embodiment the gene delivery 
vehicle is comprised of an antisense or ribozyme sequence specific for the wild-type 
alphavirus sequence generated from an RNA recombination event between the 3* 
sequences of the junction region of the vector, and the 5' alphavirus structural sequences 
of the packaging cell line expression vector. The antisense or ribozyme molecule would 

20 only be thermostable in the presence of the specific recombination sequence and would 
not have any other effect in the alphavirus packaging/producer cell line. Alternatively, 
a toxic molecule (such as those disclosed herein), may also be expressed in the context 
of a vector that would only express in the presence of wild-type alphavirus. 

25 9. Gene Delivery Vehicles to Prevent the Spread of Metastatic Tumors 

One further aspect of the present invention relates to the use of gene 

delivery vehicles for inhibiting or reducing the invasiveness of malignant neoplasms. 

Briefly, the extent of malignancy typically relates to vascularization of the tumor. One 

cause for tumor vascularization is the production of soluble tumor angiogenesis factors 
30 (TAF) (Paweletzetal., Crit, Rev. Oncol Hematoi 9:197, 1989) expressed by some 

tumors. Within one aspect of the present invention, tumor vascularization may be 
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slowed utilizing gene delivery vehicles to express antisense or ribozyme RNA 
molecules specific for TAF. Alternatively, anti-angiogenesis factors (Moses etal., 
Science 245:1408, 1990; Shapiro et al., PNAS 84:2238, 1987) may be expressed either 
alone or in combination with the above-described ribozymes or antisense sequences in 
5 order to slow or inhibit tumor vascularization. Alternatively, gene delivery vehicles can 
also be used to express an antibody specific for the TAF receptors on surrounding 
tissues. 

10. Administration of Gene Delivery Vehicles 

10 Within other aspects of the present invention, methods are provided for 

administering a gene delivery vehicle to a vertebrate or insect. Briefly, the final mode 
of gene delivery vehicle administration usually relies on the specific therapeutic 
application, the best mode of increasing vector potency, and the most convenient route 
of administration. Generally, this embodiment includes gene delivery vehicles which 

15 can be designed to be delivered by, for example, (1) direct injection into the blood 
stream; (2) direct injection into a specific tissue or tumor; (3) oral administration; 
(4) nasal inhalation; (5) direct application to mucosal tissues; or (6) ex vivo 
administration of transduced autologous cells into the vertebrate or insect. Within 
certain embodiments of the invention, for ex vivo applications cells can be first removed 

20 from a host, positively and/or negatively selected in order to yield a population of cells 
which is at least partially purified {e.g., CD34 + stem cells, T cells, or the like), 
transduced, transfected, or, infected with one of the gene delivery vehicles of the 
present invention, and reintroduced into either the same host or another individual. 

Thus, the therapeutic gene delivery vehicle can be administered in such a 

25 fashion such that the vector can (a) transduce a normal healthy cell and transform the 
cell into a producer of a therapeutic protein or agent which is secreted systemically or 
locally, (b) transform an abnormal or defective cell, transforming the cell into a normal 
functioning phenotype, (c) transform an abnormal cell so that it is destroyed, and/or 
(d) transduce cells to manipulate the immune response. 

30 
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11. Modulation of Transcription Factor Activity 

In yet another embodiment, gene delivery vehicles may be utilized in 
order to regulate the growth control activity of transcription factors in the infected cell. 
Briefly, transcription factors directly influence the pattern of gene expression through 
5 sequence-specific f/ww-activation or repression (Karin, New Biologist 27:126-131, 
1990). Thus, it is not surprising that mutated transcription factors represent a family of 
oncogenes. Gene delivery vehicles can be used, for example, to return control to tumor 
cells whose unregulated growth is activated by oncogenic transcription factors, and 
proteins which promote or inhibit the binding cooperatively in the formation of homo- 

1 0 and heterodimer trans-acli vating or repressing transcription factor complexes. 

One method for reversing cell proliferation would be to inhibit the 
//ww-activating potential of the c-myc/Max heterodimer transcription factor complex. 
Briefly, the nuclear oncogene c-myc is expressed by proliferating cells and can be 
activated by several distinct mechanisms, including retroviral insertion, amplification, 

15 and chromosomal translocation. The Max protein is expressed in quiescent cells and, 
independently of c-myc, either alone or in conjunction with an unidentified factor, 
functions to repress expression of the same genes activated by the myc/Max 
heterodimer (Cole, Cell 65:715-716, 1991). 

Inhibition of c-myc or c-myclMzx proliferation of tumor cells may be 

20 accomplished by the overexpression of Max in target cells controlled by gene delivery 
vehicles. The Max protein is only 160 amino acids (corresponding to 480 nucleotide 
RNA length) and is easily incorporated into a gene delivery vehicle either 
independently, or in combination with other genes and/or antisense/ribozyme moieties 
targeted to factors which release growth control of the cell. 

25 Modulation of homo/hetero-complex association is another approach to 

control transcription factor activated gene expression. For example, transport from the 
cytoplasm to the nucleus of the /ra/w-activating transcription factor NF-B is prevented 
while in a heterodimer complex with the inhibitor protein IB. Upon induction by a 
variety of agents, including certain cytokines, IB becomes phosphorylated and NF-B is 

30 released and transported to the nucleus, where it can exert its sequence-specific 
/rww-activating function (Baeuerle and Baltimore, Science 242:540-546, 1988). The 
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dissociation of the NF-B/IB complex can be prevented by masking with an antibody the 
phosphorylation site of IB. This approach would effectively inhibit the //ww-activation 
activity of the NF-IB transcription factor by preventing its transport to the nucleus. 
Expression of the IB phosphorylation site specific antibody or protein in target cells 
5 may be accomplished with an alphavirus gene transfer vector. An approach similar to 
the one described here could be used to prevent the formation of the /row-activating 
transcription heterodimer factor AP-1 (Turner and Tijan, Science 2^/5:1689-1694. 
1989), by inhibiting the association between the jun and fos proteins. 

10 12. Production of Recombinant Proteins 

In another aspect of the present invention, togavirus (including 
alphavirus) gene delivery vehicles can be utilized to direct the expression of one or 
more recombinant proteins in eukaryotic cells {ex vivo, in vivo, or established cell lines). 
As used herein, a "recombinant protein" refers to a protein, polypeptide, enzyme, or 

1 5 fragment thereof. Using this approach, proteins having therapeutic or other commercial 
application can be more cost-effectively produced. Furthermore, proteins produced in 
eukaryotic cells may be more authentically modified post-translationally (e.g., 
glycosylated, sulfated, acetylated, etc.), as compared to proteins produced in prokaryotic 
cells. In addition, such systems may be employed in the in vivo production of various 

20 chemical compounds, e.g., fine or specialty chemicals. 

Within this aspect, the gene delivery vehicle encoding the desired 
protein, enzyme, or enzymatic pathway (as may be required for the production of a 
desired chemical) is transformed, transfected, transduced or otherwise introduced into a 
suitable eukaryotic cell. Representative examples of proteins which can be produced 

25 using such a system include, but are not limited to, insulin {see U.S. 4,43 1 ,740 and BE 
885196A), hemoglobin (Lawn et al., Cell 2/:647-51, 1980), erythropoietin (EPO; see 
U.S. 4,703,008), megakaryocyte growth and differentiation factor (MGDF), stem cell 
factor (SCF), G-CSF (Nagata et al., Nature 319:4 1 5-418, 1 986), GM-CSF, M-CSF {see 
WO 8706954), the flti ligand (Lyman et al. (1993), Cell 75:1157-1 167), EGF, acidic 

30 and basic FGF, PDGF, members of the interleukin or interferon families, supra, 
neurotropic factors {e.g., BDNF; Rosenthal et al., Endocrinology 729:1289-1294, 1991, 
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NT-3; see WO 9103569, CNTF; see WO 9104316, NGF; see WO 9310150), 
coagulation factors {e.g., factors VIII and IX), thrombolytic factors such as t-PA (see EP 
292009, AU 8653302 and EP 174835) and streptokinase (see EP 407942), human 
growth hormone (see JP 94030582 and U.S. 4,745,069) and other animal 

5 somatotropins, integrins and other cell adhesion molecules, such as ICAM-1 and EL AM 
(see also other "heterologous sequences" discussed above), and other growth factors, 
such as IGF-I and IGF-II, TGF-p, osteogenic protein- 1 (Ozkaynak et al., EMBO J. 
9:2085-2093, 1990), and other bone morphogenetic proteins (e.g., BMP-4, Nakase et al, 
1 Bone Miner, Res. 9:651-659, 1994). As those in the art will appreciate, once 

0 characterized, any gene can be readily cloned into gene delivery vehicles according to 
the present invention, followed by introduction into a suitable host cell and expression 
of the desired gene. 

Methods for producing recombinant proteins using the vectors and 
alphavirus packaging cell lines described herein are provided (see examples 6 and 7). 
Briefly, gene delivery vehicles, in the form of in vitro transcribed RNA, plasmid DNA, 
or recombinant vector particles, which encode recombinant proteins, may be introduced 
(via transfection or infection) into alphavirus packaging cell lines (PCLs) such that only 
a small fraction of the cultured cells (<1%) contain vector molecules. Vector replicons 
are packaged by the sPs, supplied in trans by the PCL, following vector RNA 
amplification, which proceeds according to the Sindbis virus replication strategy. In 
turn, the produced recombinant vector particles infect the remaining cells of the culture. 
Thus, a bloom of recombinant protein expression results over time as recombinant 
vector particles are produced and subsequently infect ail cells in the PCL culture. 

. Similarly, amplification of vector particles with PCL may be used to generate large, 
high titer particle stocks for other applications. In yet another aspect of this invention, 
recombinant protein expression from producer cell lines is described (see Example 7). 
Briefly, cell lines are derived which contain all of the genetic elements, including vector 
replicon and defective helper expression cassettes, from which the production of vector 
particles can be induced, via addition of an extracellular stimulus to the culture. Thus, 
expression of vector-encoded recombinant protein occurs as a result of induction of 
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alphavirus vector particle producer cell lines. In yet a still further aspect of this 
invention, recombinant protein expression from cell lines stably transformed with 
eukaryotic layered vector initiation systems are described (see Example 7). Briefly, cell 
lines are derived which are stably transformed with an inducible eukaryotic layered 
5 vector initiation system cassette that encodes a recombinant protein of interest. Thus, 
expression of vector-encoded recombinant protein occurs as a result of induction of the 
eukaryotic layered vector initiation system cassette. 

As should be readily understood given the disclosure provided herein, 
protein production utilizing RNA vectors replicons, eukaryotic layered vector initiation 
10 systems, or recombinant vector particles may also be accomplished by methods other 
than introduction into packaging or producer cell lines. For example, such vectors may 
be introduced into a wide variety of other eukaryotic host cell lines (e.g., COS, BHK, 
CHO, 293, or HeLa cells), as well as direct administration in vivo or to ex vivo cells, in 
order to produce the desired protein. 

15 



20 



J- Deposit Information 

The following materials have been deposited with the American Type 
Culture Collection: 



Deposit 

Wild type Sindbis virus 
SIN-1 Sindbis virus 



Designation Deposit Date 

CMCC#4639 April 2, 1996 
CMCC #4640 April 2, 1 996 



pBG-SIN 1 ELVS 1 .5 SEAP CMCC #464 1 April 2, 1 996 



Accession 

No. 
VR-2526 
VR-2527 

97502 



The above materials were deposited by Chiron Corporation with the 
American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, 
Maryland under the terms of the Budapest Treaty on the International Recognition of 
25 the Deposit of Microorganisms for purposes of Patent Procedure. The accession 
number is available from the ATCC at telephone number (301) 881-2600. 
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These deposits are provided as convenience to those of skill in the art, 
and are not an admission that a deposit is required under 35 U.S.C. § 1 12. The nucleic 
acid sequence of these deposits, as well as the amino acid sequence of the polypeptides 
encoded thereby, are incorporated herein by reference and should be referred to in the 
5 event of an error in the sequence described therein. A license may be required to make, 
use, or sell the deposited materials, and no such license is granted hereby. 



The following examples are included to more fully illustrate the present 
invention. Additionally, these examples provide preferred embodiments of the 

10 invention and are not meant to limit the scope thereof. Standard methods for many of 
the procedures described in the following examples, or suitable alternative procedures, 
are provided in widely reorganized manuals of molecular biology, such as, for example, 
"Molecular Cloning," Second Edition (Sambrook et aL, Cold Spring Harbor Laboratory 
Press, 1987) and "Current Protocols in Molecular Biology" (Ausubel et al. t eds. Greene 

1 5 Associates/Wiley Interscience, NY, 1 990). 
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EXAMPLES 

EXAMPLE 1 
Isolation and Characterization of Sin 1 

5 

Below, the identification and molecular characterization of a positive 
strand RNA virus which exhibits reduced inhibition of host macromolecular synthesis 
and is capable of establishing persistent infection in vertebrate cells, as compared to 
lytic, cytopathogenic wild type strains of the same virus, is described. For example, 
1 0 Sindbis virus is used as a prototype representative of the Alphavirus genus. 

A * Isolation, p laque purification, and characterization of SIN-1 from a wild-typ e 
Sindbis virus stock 

The isolation, molecular cloning, and characterization of a Sindbis virus 
1 5 variant strain is described. This strain is able to establish productive persistent infection 
in the absence of cytopathicity, but produce levels of virus equivalent to that of 
wild-type virus. 

A high-titered (>10 8 PFU/ml) wild-type stock obtained by infection of 
BHK cells (ATCC No. CCL-10) with Sindbis virus (CMCC #4639) at low MOI (< 0.1). 

20 To facilitate infection, the virus inoculum was contained in a volume just sufficient to 
cover the monolayer when added to the cells. BHK cells were maintained, and all virus 
dilutions were performed, in Eagle minimal essential medium supplemented with 10% 
fetal calf serum. Cells were cultured at 37°C in a 5% C0 2 atmosphere. Extensive CPE, 
- as demonstrated by "rounding up", loss of adhesion, and increased light refraction of 

25 individual cells within the monolayer, and additionally, the decreased overall cell 
density of the monolayer, was observed within 48 hours post infection (hpi). The cell 
culture fluids were collected, cell debris was removed by low speed centrifiigation 
(4,000 rpm for 10 min at room temperature), and the virus stock was aliquoted and 
stored at -70°C. The titer of the Sindbis virus stock was determined by plaque assay as 

30 described previously (Strauss et al., Virology 74:154-168, 1976). Briefly, chicken 
embryo fibroblasts (CEF) monolayers were infected with various dilutions of the virus 
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stock and the monolayer overlayed with media supplemented with 0.75% agarose. At 
24-48 hpi plaques due to cell lysis were visualized and quantitated either directly or, 
alternatively, by staining with crystal violet after removing the agarose overlay. The 
virus titer was determined from samples infected with virus dilutions in which the 
5 plaques were accurately quantitated. 

A Sindbis virus stock enriched for Dl particles was obtained by repeated 
high MOI passage (> 5) on BHK cells. BHK monolayers were infected initially with 
the Sindbis virus seed stock at an MOI =5. The culture medium was collected and 
clarified by low speed centrifugation after complete cell lysis of the infected culture was 

10 observed (usually within 24 hpi). The clarified medium collected from the infected 
culture was then used to infect a fresh BHK monolayer. For example, 2 ml of the virus 
inoculum was added to a fresh BHK monolayer in a 10 cm petri dish. At 1 hpi, 8 ml of 
fresh medium was added to the virus-infected culture. As described above, the culture 
medium was collected and clarified after observation of complete cell lysis of the 

1 5 culture. This process was repeated until the rate at which cytopathogenicity in the BHK 
monolayer developed after infection was delayed until at least 4 days. The delay in the 
onset of cytopathogenicity after infection signifies the presence of a high level of DI 
particles in the virus preparation. 

The presence of a high level of DI particles in the virus preparation 

20 derived from multiple serial undiluted passages of infected cell medium was determined 
by an interference assay or by RNA analysis of BHK infected cells. In the first method, 
a homologous interference assay was performed as a measure of the presence of DI 
particles. Briefly, BHK cells were infected alone at an MOI = 10 with the high-titered 

8 

- ■■ (>10 PFU/ml) wild-type stock prepared as described above At 16 hpi, the virus yield 
25 was determined by plaque assay, as described above. The virus yield from this 

9 10 

experiment was typically 1 x 10 to 1 x 10 PFU/ml. In another experimental group, 
BHK cells were simultaneously coinfected with the wild-type stock (MOI - 10) and the 
virus stock prepared from multiple serial undiluted passages of infected cell medium 
(MOI = 5). As before, the virus yield was determined at 16 hpi by plaque assay. If the 
30 virus stock from the second experimental group contains a high level of DI particles, the 
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virus yield will be at least 2-3 orders of magnitude lower than the first experiment (eg.. 
< 1 x 10 7 PFU/ml). 

As a more definitive method for demonstrating the presence of Dl 
particles in the virus preparation, virus-specific RNA in BHK infected cells at 16 hpi 
5 was analyzed. Briefly, BHK cells were infected (MOI = 10) with the high-titered (>10* 
PFU/ml) wild-type stock or with the virus stock containing DI particles. Mock-infected 
controls and infected cells were treated with dactinomycin (1 mg/ml) and labeled with 
[ 3 H]uridine (20 mCi/ml) from 1 to 16 hpi. RNA was isolated from infected and control 
cells by using RNAzol B, as described by the manufacturer (Tel-Tesu Inc., 

10 Friendswood, Texas). Alternatively, RNA was isolated with Tri-Reagent (Molecular 
Research Center, Inc., Cincinnati, Ohio), or by conventional methods using phenol 
extraction of cells lysed in a buffer (0.05 M Tris, 0.1 M NaCl, 0.001 M EDTA, pH 7.5) 
containing 0.5% Triton and 0.5% recrystallized naphthalene disulfonate, as described 
by Weiss et al. (J. Virol. 1 4:1 189-1 198, 1974). The RNAs were denatured with glyoxal 

15 and electrophoresed through 1.1% horizontal agarose gels prepared in 0.01 M sodium 
phosphate buffer (pH 7.0), at 5V/cm (McMaster and Carmichael, Proc. Nad Acad Sci. 
USA 74:4835-4838, 1977). Alternatively, RNA can be electrophoresed through 
formaldehyde gels. Following electrophoresis, all moisture was removed from the gels 
under vacuum with a gel dryer, and the dried gels were treated for fluorography and 

20 exposed to film. Two RNAs, corresponding to the genomic and subgenomic species 
(42S and 26S, respectively), were observed in samples from BHK cells infected with 
the wild-type virus stock. In contrast, a large number of RNA species that are distinct 
from the standard viral 42S and 26S RNAs were observed in samples from BHK cells 
infected with the virus stock containing DI particles. Multiple RNAs corresponding to 

25 DI RNAs migrated predominantly at molecular weights smaller than the 26S RNA 
species from wild-type virus. An example of multiple RNAs in addition to the 42S and 
26S species observed in BHK cells infected with a virus stock containing DI particles 
may be seen in Figure 3 } lane 3, of Weiss et al. (J. Virol 35:463-474, 1980). 

A Sindbis virus variant strain which is able to establish productive 

30 persistent infection with decreased cytopathogenicity was isolated and molecularly 
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cloned from a virus stock enriched for DI particles. BHK cells were infected at high 
multiplicity (MOI = 5) with the Sindbis virus stock enriched for DI particles. 
Cytopathogenicity developed slowly compared to infection of BHK cells with wild-type 
virus; however, most cells were eventually lysed and detached from the plate. Cell 
5 debris and non-adherent cells were removed every two days by medium changes. 
Within two weeks after initial infection, separate and distinct colonies were observed. 
These colonies were thriving and demonstrated no morphological evidence of CPE, 
compared to uninfected BHK cell controls. Within 3-4 weeks, the cell colonies were 
large and discernible to the naked eye. The colonies were isolated with cloning rings, 

10 and the cells were dispersed with either 3 mM EDTA or trypsin. Dispersed cells from 
each colony were replated without dilution. Thereafter, cells were subcultured at a 1 : 1 0 
dilution upon reaching confluency, generally within four days. Aliquots of cells, 
designated BHK(SIN-l), were prepared in cryotubes after the fifth passage for long 
term storage in liquid nitrogen. BHK(SIN-l) cells were indistinguishable from the 

1 5 original, uninfected BHK cells in terms of growth rate or morphology. 

B. Molecular Cloning of SIN- 1 

To characterize the mutation(s) in the Sindbis genome which correlate 
with the development of the substantially reduced cytopathogenicity of SIN-1, genomic 
20 RNA from SIN-1 virions was isolated, reverse transcribed, and the resultant cDNA 
encompassing the nonstructural protein genes sequenced, as described more fully 
below. 

Briefly, the SIN- 1 virus was plaque purified three times before 
- preparation of a stock that was used for the isolation of RNA. The BHK(SIN-l) cells 
25 were grown as described above, the culture fluid was collected, and various dilutions 
were used to infect primary chicken embryo fibroblast monolayers (CEF, grown in 
minimal essential medium supplemented with 3% fetal calf serum). Following 
infection, medium containing 0.5% noble agar was added to the monolayers. 
Additionally, DEAE-dextran (100 yg/mL) can be included in the agar-overlay medium 
30 to increase the size of the SIN-1 plaques. Individual discreet plaques were observed 
after 3 days of incubation at 30°C in plates infected with suitably dilute inoculums of 
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the BHK(SIN-l) culture fluid. After the third round of purification, the cloned SIN-1 
virus was passaged once at 30°C in CEF cells infected at a MOI = 0.1. The plaque- 
purified SIN-1 vims preparations were determined to be free of DI particles by the 
interference assay and RNA analysis in BHK infected cells, as described above. 
5 BHK cells were infected (MOI = 10) with the plaque purified SIN-1 

stock to determine the ability of this wild-type Sindbis virus variant strain to establish 
persistence. As described above, establishment of persistent infection in BHK cells 
with wild-type Sindbis virus requires the presence of DI particles in the virus 
preparation and considerable time to allow those few surviving cells to grow out. In 

10 contrast, persistent infection was readily established in BHK cells infected at 37°C with 
the SIN-1 variant whether or not DI particles arc present in the virus inoculum. At six 
days post infection with SIN-1, the BHK cells were completely resistant to 
superinfection with wild-type Sindbis virus, demonstrating establishment of a persistent 
infection. However, these cells were susceptible to infection by the heterologous virus, 

15 VSV, demonstrating that interferon is not involved in the establishment of SIN-1 
persistent infection. 

CEF cells were infected (MOI = 10) with the plaque purified SIN-1 stock 
to generate a high titered stock of virus for the isolation of RNA. Ninety ml of culture 
fluid (2xl0 I0 PFU/ml) was clarified by centrifugation for 5 min at 2,500 rpm in a Sorvall 

20 GSA rotor. The SIN- 1 virus was pelleted from the clarified culture fluid by 
centrifugation in an SW27.1 rotor for 2h at 24,000 rpm, at 4°C. The virus pellet was 
then resuspended in 3 ml of culture media, homogenized by repeated pipetting, and 
layered on top of a 25-40% (w/w) sucrose gradient. This was followed by 
* centrifugation in an SW27.1 rotor for 4 h at 24,000 rpm, at 4°C. The virus band was 

25 visualized with incandescent light illumination, and collected with a 22 gauge 
needle/syringe. The RNA was purified further by Proteinase K (Boehringer Mannheim, 
Indianapolis, Indiana) digestion (56°C, 1 hr), followed by extraction with an equal 
Volume of H 2 0-saturated phenokchlorform (1:1 v/v, pH 7.0), followed by precipitation 
with 2 volumes of ethanol and 0.3 M sodium acetate, pH 5.2. 
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Alternatively, the SIN-1 virus can be further purified by polyethylene 
glycol precipitation of infected BHK cell culture media (Strauss et al. Virology 
753:154-168, 1976), or alternatively by pelleting through a sucrose cushion (Polo et al. 
1 Virol. 52:2124-2133, 1988). 
5 Two rounds of first strand cDNA synthesis were performed with the 

purified viral RNA, using the Moloney murine leukemia virus or Superscript II reverse 
transcriptases (Gibco-BRL, Gaithersburg, Maryland) according to the manufacturer's 
recommended conditions. Six separate reactions were performed, using the Sindbis 
virus specific primers complementary to the positive strand (primers denoted by R) 
10 shown below. AH primers denoted by R were phosphorylated at their 5' end with 
polynucleotide kinase (New England Biolabs) prior to the first strand synthesis reaction 
to facilitate cloning. 
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Primer Location Seq. ID No. 

T7/IF Sac 1/T7/1-20 2 



Sequence (5' -> 3') 



Enzyme Site 



I465R 1465-1451 



. GGTGGAGCTCTAATACGACf SaTl 

CACTATAGATTGACGGCGTA 
GTACACAC 

A ATTTCTGCCTCAGC £ co 47 III 



I003F 
2823R 

1003F 
4303 R 



1003-1019 
2823-2806 

1003-1019 
4303-4289 



4 
5 

4 

6 



TATGCAAAGTTACTGAC 
CTGTCATTACTTCATGTC 

TATGCAAAGTTACTGAC 
GCGTGGATCACTTTC 



£co47 III 
BspHl 

Eco41 III 
AvrU 



405 IF 
8II5R 



4051-4069 
8115-8101 



7 
8 



ATTGCGTGATTTCGTCCGT 
TAAATTTGAGCTTTG 



AvrU 
Pml\ 



1680F 1680-1689 
81I5R 8115-8101 



9 
8 



GGCATATGGCATTAGTTG 
TAAATTTGAGCTTTG 



BspH\ 
Pml\ 



8034F 8034-8052 
11,703R XhoVdTJ 

1 1703-11677 



10 
II 



CTGGCCATGGAAGGAAAGG Pml I 

CCCCTCG AGGGT(2 1 )GAAATG Xho I 

TTAAAAACAAAATTTTGTTG 



Synthesis of the second strand from the cDNA template above was 
accomplished in six separate reactions with the Klenow fragment of DNA polymerase 1 
(New England Biolabs, Beverly, MA) according to the manufacturer's recommended 
5 conditions. Sindbis virus specific primers complementary to the negative strand 
^ (primers denoted by F shown above) were used. The double-stranded DNA products 
were substituted, stepwise, for the corresponding regions in the plasmid TotollOl, 
which contains the full-length Sindbis virus genome (Rice et al., J, Virol 67:3809-3819, 
1987). For example, the T7/1F-1465R product was digested with Sac I and Eco 47IH, 
10 and inserted into Sac l/Eco 471II digested and CIAP treated Totol 101 plasmid, which 
was purified away from the Sac l/Eco 47III small fragment (SP6 promoter, Sindbis 
virus nts. 1-1407) by 1% agarose/TAE (50X/Iiter: 242 g Tris base/57.1 ml glacial acetic 
acid/100 ml 0.5 M EDTA pH 8.0) electrophoresis, and GENECLEAN II. This 
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construct was then digested with Eco 47III and Avr II (Sindbis virus nt nos. 1407 and 
4281, respectively), treated with CIAP, and followed by insertion of the 3300 bp 
fragment isolated from the I003F-4303R product, digested with Eco 47 III and Avr II. 
The fully assembled clone is designated as pRSIN-lg (g, as a reference to full-length 
5 genomic clone), and contained all 1 1,703 bp of viral genome. A subset of the primers 
listed in the table above generate redundant double-stranded DNA reaction products 
within the SIN-1 genome. For example, the sequences in the 4051F/81 15R product are 
within the 1680F/8115R product. These redundant products are provided as 
construction alternatives for the SIN-1 genomic clone; i.e., in general, the efficiency of 

1 0 cDN A cloning is inversely proportional to the length of the desired fragment. 

To clone portions of the viral genome not obtained by the above method, 
the SIN-1 RNA viral genome was cloned by reverse transcription polymerase chain 
reaction (RT-PCR). First strand synthesis was accomplished as described above. PCR 
amplifications of Sindbis cDNA with the primer pairs shown above were performed as 

1 5 separate reactions, using the Klentaql enzyme, and the reaction conditions, as described 
in Barnes (Proc. Nad Acad ScL USA 97:2216-2220, 1994). Alternatively, the 
Thermalase thermostable DNA polymerase (Amresco Inc., Solon, OH) was substituted 
for the Klentaq 1 enzyme, using a buffer containing 1.5 mM MgCl 2 that was provided 
by the supplier. Alternatively, the VentR thermostable DNA polymerase (New England 

20 Biolabs, Beverly, MA) was used in the amplification reactions. Additionally, the 
reactions contained 5% DMSO and n HOT START WAX" beads (Perkin-Elmer). The 
PCR amplification protocol used is shown below (The 72°C extension incubation 
period was adjusted to I min per 1 kb of template DNA): 
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Alternatively, cloning of the full-length SIN-1 RNA genome can be 
performed similar to methods which have previously been described (Dubensky et al., 
J. Virol 70:508-519, 1996). Briefly, first strand cDNA synthesis is accomplished with 
5 a mixture of random hexamer primers (50 ng/ml reaction concentration) (Invitrogen, 
San Diego, California), and primer 4B, whose sequence is shown in the table below. 
Genomic length Sindbis vims SIN-1 variant cDNA is then amplified by PCR. Six 
distinct segments using six pairs of overlapping primers is sufficient to clone the entire 
genome. In addition to viral complementary sequences, the SIN-1 5" end forward 

10 primer contains a 19 nucleotide sequence corresponding to the bacterial SP6 RNA 
polymerase promoter and the Apa I restriction endonuclease recognition sequence 
linked to its 5' end. The bacterial SP6 RNA polymerase is poised such that transcription 
in vitro results in the inclusion of only a single non-viral G ribonucleotide linked to the 
A ribonucleotide, which corresponds to the authentic Sindbis virus 5' end. Inclusion of 

15 the Apa I recognition sequence facilitates insertion of the PCR ampiicon into the 
plasmid vector pKS 11+ (Stratagene, La Jolla, California) polylinker sequence. A five 
nucleotide 'buffer sequence' extension is also linked upstream from the Apa\ 
' recognition sequence in order to facilitate efficient enzyme digestion. The sequence of 
the SP6-5' SIN-1 forward primer and all of the primer pairs necessary to amplify the 

20 entire SIN- 1 genome are shown in the table below. (Note that "nt" and "nts" as utilized 
hereinafter refer to "nucleotide" and "nucleotides," respectively). The reference 
sequence (GenBank accession no. J02363, locus: SINCG) is from Strauss et al., 
Virology ] 33:92-1 10, 1984. 
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Primer Location Seq. ID No. Sequence (5'->3*) Recognition 



Sequence 



SP6-IA 


Apa I/SP6/ 
SrN nts.1-18 


. 12 


TATATGGGCCCGATTTAGGTGAC 
ACTATAGATTGACGGCGTAGTAC 
AC 

CTGGCAACCGGTAAGTACGATAC 


Apa I 


IB 


3182-3160 


13 




2A 
2B 


3144-3164 
5905-5885 


14 

15 


ATACTAGCCACGGCCGGTATC 
TCCTCTTTCGACGTGTCGAGC 


Eco RI 


3A 

7349R 


5844-5864 
7349-7328 


16 
17 


ACCTTGGAGCGCAATGTCCTG 
CCI 1 1 1 CAGGGGATCCGCCAC 


£co RJ 
i9amHI 


/ JZor 

3B 


9385-9366 


I o 
19 


O 1 uuCGUATCCCCTGAAAAGG 
TGGGCCGTGTGGTCGTCATG 


5am HI 

Bell 


4A 

I0394R 


9336-9356 
10394-10372 


20 
21 


TGGGTCTTCAACTCACCGGAC 
CAATTCGACGTACGCCTCACTC 


Bell 
Aw WJ 


I0373F 
4B 


10373-10394 
Xba F/T / 

11703-11698 


22 
23 


GAGTGAGGCGTACGTCGAATTG 
TATATTCTAGA(T 35 )GAAATG 


Bsi WI 
Xbal 



PGR amplifications of Sindbis cDNA with the primer pairs shown above 
are performed as separate reactions, using the Thermalase or Vent R DNA polymerases 
(cited above), reaction conditions, and the PCR amplification conditions, as described 
5 above. 

The regions of sequence overlap between the amplification products 
correspond to unique enzyme recognition sites within the PCR amplicon. The PCR 
products are purified (QIAquick PCR purification kit, Qiagen, Chatsworth, California) 
and inserted stepwise into the pKS 11+ vector, between the Apa I and Xba I sites. The 
10 fully assembled clone is designated as pKSRSIN-lg (g, as a reference to full-length 
genomic clone), and contains all 1 1 ,703 bp of viral genome. 

C. Sequence of the SIN-1 phenotvpe 

The SIN-1 specific nucleotide sequences of the pRSIN-lg clone was 
1 5 determined by the dideoxy-chain termination method. Sequence comparison of 8,000 
bp of viral sequence revealed multiple differences between the SIN-1 clone described 
herein and the Sindbis virus (strain HRsp) sequence provided in GenBank (GenBank 



WO 97/38087 



PCT/US97/06010 



100 

Accession no. J02363, locus: SINCG). Differences in the sequence among SIN-1 
(Figure 6), SINCG (Figure 7), and fotol 101 (Figure 8) are presented below. 



nt. Position 


Gene 




SINCG 


Toto 1101 




SIN-1 






nt 


aa 


nt 


aa 


nt 


aa 


45 


5'NTR 


T 




T 





C 




120 


nsPl 


C 


Gin 


C 


Gin 


A 


Lvs 


1775 


nsP2 


G 


null 


G 


null 


A 


null 


1971 


nsP2 


T 


Phe 


* T 


Phe 


C 


Leu 


2992 


nsP2 


C 


Pro 


T 


Leu 


T 


Leu 


3579 


nsP2 


A 


Lys 


G 


Glu 


G 


Glu 


3855 


nsP2 


C 


Pro 


C 


Pro 


T 


Ser 


3866 


nsP2 


C 


null 


C 


null 


T 


null 


4339 


nsP3 


A 


Glu 


A 


Glu 


T 


Val 


4864 


nsP3 


C 


Ser 


C 


Ser 


T 


Phe 


5702 


nsP3 


A 


null 


T 


null 


T 


null 


5854* 


nsP4 


G 


Arg 


G 


Arg 


A 


His 


7612 


junction 


A 




A 




T 




7837 


Capsid 


C 


Arg 


C 


Arg 


T 


Cys 



5 ' This mutation was found in one cDNA clone. It was not detected when the Sin-1 virus 
RNA was sequenced. It likely represents a minor species in the RNA population. 

Verification that the sequence changes were unique to the clone (and not 
the result of cloning artifact) described herein, was determined by amplifying SIN-1 
10 virion RNA by RT-PCR as described above, establishing the sequence containing the 
- nucleotides in question by direct sequencing of the RT-PCR amplicon product, and 
comparing the sequence to the corresponding SIN-1 sequence. 

D - Characteriz ation and genetic mapping of the SIN-1 phenotype with molecular 
15 clones: 

Various regions of the SIN-1 genome were substituted for the 
corresponding wild-type Sindbis virus region in the Totol 101 plasmid (Rice et a!., 1 
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Virol 67:3809-3819, 1987) in order to map the location of the phenotype for 
establishment of persistence. The various SIN-1 nsP genes were substituted into the 
Toto 1101 wild-type Sindbis virus background using restriction enzyme fragments 
purified from pRSfN-lg, as illustrated in the table below. 



nsP Gene 


Restriction Fragment 


Nucleotide 
Coordinates 


Clone Oe^ionatinn 


nsPl 


Pie VEco 47III 


98-1407 


pRSIN-lnsPl 


nsP2 


Eco4im/AvrU 


1407-4281 


pRSIN-lnsP2 


nsP2-N terminus 


Eco AllWBgl II 


1407-2289 


pRSIN-lnsP2-N 


nsP2-C terminus 


Bglll/AvrU 


2289-4281 


pRSIN-lnsP2-C 


nsP3-4 


AvrWEcoM 


4281-5870 


pRSIN-lnsP3 


nsP3 


Avr WSpe I 


4281-5262 


pRSIN-lnsP3 


nsP4 


Spe MAat II 


5263-5870 


pRSIN-lnsP4 


nsPl-4 


Pie MAat II 


98-8000 


pRSIN-lnsPl-4 



The coordinates of the nonstructural gene coding regions are provided in the following 
table: 



10 



nsP Gene Coordinates of Sindbis 

virus genome (nL no.) 

nsPl 60-1680 

nsP2 1680-4101 

nsP3 4101-5769 

nsP4 5769-7597 

nsPl-4 60-7597 

The various SIN-1, Toto, and chimeric SIN-1 /Toto clones, pRSIN-lg. 
Toto, pRSIN-lnsPl, pRSIN-lnsP2, pRSIN-lnsP2-N, pRSIN-lnsP2-C, pRSIN-lnsP3, 
pRSIN-lnsP4, and pRSIN-lnsPl-4 were linearized by digestion with Xho I, which 
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makes a single cut in the cDNA clones immediately adjacent and downstream of a 21 
nucleotide poly dA:dT tract following the Sindbis virus 3' end (viral nt. 1 1703). The 
linearized clones were purified with GENECLEAN II (BIO 101, La Jolia, California), 
and adjusted to a concentration of 0.5 ng/nl. Transcription of the linearized clones was 
5 performed in vitro at 40°C for 90 minutes according to the following reaction 
conditions: 2 ^1 DNA/4 .25 nl H 2 0); 10 nl 2.5 mM NTPs (UTP, ATP, GTP, CTP); 
1.25 |il 20 mM Me7G(5 , )ppp(5 , )G cap analogue; 1.25 nl 100 mM DTT; 5 jil 5X 
transcription buffer (Promega, Madison Wisconsin); 0.5 pi RNasin (Promega); 0.25 nl 
lOpg/pl bovine serum albumin; and 0.5 \il T7 RNA polymerase (Promega). The 

10 in vitro transcription reaction products were digested with DNase I (Promega), purified 
by sequential phenol:CHCl 3 and ether extraction, and followed by ethanol precipitation 
Alternatively, the in vitro transcription reaction products can be used directly for 
transfection. The in vitro transcription reaction products or purified RNA were 
complexed with a commercial cationic lipid compound (LIPOFECTIN, GIBCO-BRL. 

1 5 Gaithersburg, MD) and applied to Baby Hamster Kidney-2 1 (BHK-2 1 ) cells maintained 
in a 60 mm petri dish at 75% confluency. Alternatively, BHK cells were electroporated 
with the in vitro transcription reaction products or purified RNA, exactly as described 
previously (Liljestrom, Bio/T echnology 9:1356-1361, 1991). The transfected cells were 
incubated at 37°C. At 48 hours post-transfection, culture media were collected and the 

20 titer of each virus was determined by plaque assay, as described above. The titered 
virus stocks derived from these in vitro transcription reactions were designated as 
shown in the table below. 
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Clone Designation 


Virus Designation 


pRSIN-lnsPl 


SIN-lnsPl 


pRSIN-lnsP2 


SIN-lnsP2 


pRSIN-lnsP2-N 


SIN-lnsP2-N 


pRSIN-lnsP2-C 


SIN-lnsP2-C 


pRSIN-lNSp3-4 


SIN-lnsP3-4 


pRSIN-lnsP3 


SIN-lnsP3 


pRSIN-lnsP4 


SlN-lnsP4 


pRSIN-lnsPl-4 


SIN-lnsPl-4 


TotollOi 


Toto 



To map the SIN-1 persistent phenotype, 8 x 10 S BHK cells were infected 
(MOI=5) with each of the virus stocks prepared above. At 3 days post infection, the 
culture viability was determined by trypan blue dye exclusion. The results of this 
5 experiment (shown below), demonstrate that the SIN-1 phenotype of establishing 
persistent non-cytocidal infections maps to the nonstructural genes, and to nsP2 gene'in 
particular. The number of cells in the mock-infected culture represents continued 
growth of these cells until they reached the stationary phase. At 3 dpi, cells infected 
with SIN-lnsPl, SIN-lnsP3, SIN-l-nsP3-4 and SIN-lnsP4 had all died. The cells that 
10 survived infection with SIN-lnsP2 and SIN-lnsPl-4 continued to grow and were 
persistently infected based on staining with antibodies specific for Sindbis virus. 
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v «us NumberofCellsat3dpi 

SIN-lnsPl : o 

SIN-lnsP2 3xl0 5 

SIN-lnsP3-4 o 

SIN-lnsP3 o 

SIN-lnsP4 o 

SIN-lnsPl-4 5xI0 ' 

Toto o 

Mock lxl0 ^ 



As shown above, the observed SIN-1 phenotype of establishing non- 
cytocidal persistent infections maps to the viral nsPs, as opposed to the sPs. This 
conclusion was demonstrated clearly by comparison of cell survival levels between 
5 cultures infected with the Toto or SIN-lnsPI-4 virus stocks. Both the Toto and SIN-1 
nsPl-4 viruses contain the wild-type sPs; cell survival was observed, however, only in 
those cultures infected with the virus (SIN-lnsPI-4 ) containing nsPs derived from the 
SIN-1 clone. In these experiments, cell survival was not dependent upon the source of 
the Sindbis virus sPs. Importantly, the SIN-1 phenotype was mapped further to nsP2. 

10 The level of cell survival was comparable between cultures infected with the SIN- 
lnsPl-4 or SIN-lnsP2 viruses. Further, a C -> T transition at nucleotide 3855, in the 
SIN-1 nsP2 gene is responsible for the characteristic phenotype of establishment of 
persistent infection in cells infected with the SIN-1 virus. The single proline to serine 
change in the nsP2 protein produced in cells infected with the chimeric virus SIN- 

15 lnsP2-C, was all that was required to convert wild-type Sindbis virus (Toto 1101) from 
a virus that killed all of the infected cells into a virus which permitted many of the 
infected cells to survive and continue to produce virus. The phenotypes of chimeric 
viruses derived from insertion of the SIN-1 nsPl, nsP3, or nsP4 genes into the Toto 
background were indistinguishable from wild-type and complete lysis was observed in 

20 cultures infected with these viruses. 
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The possible effect of amino acid changes in the SrN-1 nsPs on the level 
of productive infection in BHK cells was determined by comparing the virus yield over 
a time course in BHK cells inoculated with the various SIN-1 , wild-type (Toto), or SIN- 
1/Toto chimeric strains. Briefly, BHK cells were infected (MOI=20) with S1N-1 
5 (plaque purified stock described above), Toto, SIN-lnsPl-4, or SIN-lnsP2 viruses, and 
the culture fluids were collected at 3, 6, 9, and 12 hours post infection. The titers of 
virus in the culture fluids were then determined by plaque assay, as described above. 
The results of this study, shown in Figure 2, demonstrate that equivalent levels of virus 
were produced in BHK cells infected with wild-type or SIN-1 strains. The actual virus 
10 titers at the 12 hpi time point are set forth in the table below. More than half of the 
BHK cells survived infection with SIN-1 virus (and chimeric viruses containing SIN-1 
nsPsl-4, or SIN-1 nsP2) in combination with levels of virus production equivalent to 
wild-type strains. 



15 



Virus Titer (PFU/ml) at 1 2 hpi (x 1 o') 
SIN-1 T6 
Toto 2.7 
SIN-lnsPl-4 1.8 
SIN-lnsP2 2.6 



The possible effect of amino acid changes in the SIN-1 nsPs on the level 
of viral-specific RNA synthesis was determined by comparing the level of [ 3 H]-uridine 
incorporation over a time course in BHK cells inoculated with the various SIN-1, wild- 
type (Toto), or SIN-l/Toto chimeric strains. BHK cells (3 x 10* cells/35 mm dish) were 

20 grown at 37°C, according to the conditions described herein. The cells were infected 
(MOI = 20) with SIN-1, TotollOl, SIN-lnsPl-4, or SIN-lnsP2 viruses. At 30 min. 
post infection, the culture medium was adjusted to 1 ng/ml actinomycin D. After 
incubation for an additional 30 min, the culture medium was adjusted to 10 jiCi/ml 
[ 3 H]-uridine. At 3, 6, 9, and 12 hpi, the treated cells were washed with PBS, and lysed 

25 by addition of 200 \xl of TTE buffer (0.2% Triton X-100, 10 mM Tris-HCl, pH 8.0, 1 
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mM EDTA). The RNA was precipitated at 4°C by addition of 200 ul of a 25% 
trichloroacetic acid (TCA) solution. The RNA was pelleted by microcentrifugation for 
5 min at 14,000 rpm, rinsed once with 5% TCA, and dissolved in a solution consisting 
of 50 mM NaOH/0.1 % SDS, at 55°C. The solution containing the dissolved RNA was 
5 transferred into scintillation vials and the level of incorporated [ 3 H]-uridine was 
determined using EcoLume (ICN, Irvine, CA) scintillation fluid. The results of this 
study, shown in Figure 3, demonstrate that levels of virus-specific RNA were 
dramatically lower in BHK cells infected with SIN- 1 compared to wild-type virus. This 
phenotype of low level of virus-specific RNA synthesis maps to the nsPs, as shown by 
10 the equivalents low levels of RNA produced in BHK cells infected with the SIN-1 or 
SIN-lnsPl-4 strains, compared to wild-type. 



[ } H]uridine incorporation (xlO* cpm) 



SIN-1 22 

TotollOl 86 

SIN-lnsPl-4 28 

SIN-lnsP2 62 

Mock g 



Virus-specific RNA synthesis in infected BHK cells was also determined 
15 using all of the SIN-1, Toto, and SIN-l/Toto chimeric, strains. The levels of 
pHluridine incorporation, relative to wild-type infection (Toto) at 9 hpi, are shown in 
Figure 4, and given in the table below. 
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Virus 


No. of Experiments 


Relative RNA 
Synthesis Level 


Standard D< 


TotollOl 


9 


1.0 




SIN-1 


9 


0.1 


+0.1 


SIN-lnsPl-4 


8 


0.2 


±0.1 


SIN-lnsPl 


7 


1.0 


±0.2 


SIN-lnsP2 


S 


0.6 


+0.1 


SIN-lnsP3 


4 


1.0 


±0.0 


SIN-lnsP3-4 


7 


0.4 


±0.1 


SIN-lnsP4 


6 


0.8 


±0.1 


SIN-lnsP2-N 


1 


0.9 




SIN-lnsP2-C 


3 


0.6 


±0.2 


Mock 


9 


0.0 





Thus, BHK cells survive infection with SIN-1 virus (and chimeric viruses containing 
SIN-1 nsPsl-4, or SIN-1 nsP2), SIN-1 virus levels equivalent to wild-type strains are 
produced in BHK cells, and, the level of viral-specific RNA synthesized is 10-fold 
5 lower compared to wild-type virus. 

The possible effect of amino acid changes in the SIN-1 nsPs on the level 
of inhibition of host cell protein synthesis was determined by comparing the level of 
protein synthesis, relative to uninfected cells, over a time course in BHK cells 
inoculated with the various SIN-1, wild-type (Toto), or SIN-l/Toto chimeric strains. 

10 Briefly, 35 mm dishes seeded with 2 x 10 $ BHK cells were infected (MOI = 20) with 
the various Sindbis virus strains in a 0.5 ml virus inoculum in a buffer consisting of 
PBS/1% fetal calf serum. The plates were incubated at 4°C for 1 hr with continuous 
gentle shaking. The inoculum was replaced with 2 ml of medium, described previously, 
containing 10% fetal calf serum, and the dishes were placed in a CO, incubator at 37°C. 

15 At 5, 8, and 1 1 hr later, the media was replaced with 2 ml of MEM lacking methionine 
(Met-), with 2% fetal calf serum, and incubated 30 min. The medium was then replaced 
with 1 ml of MEM (Met-) containing 2% fetal calf serum and 10 \xC\lm\ of 
[ 35 S]methionine. Following a 30 min incubation period at 37°C, 1 ml of medium 
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containing 10% fetal calf serum was added to each well, and the dishes were incubated 
for another 30 min at 37°C. This dilution is sufficient to inhibit further incorporation of 
radioactive label into protein and significantly decreases the background of free [ 35 S]- 
methionine detected in polyacrylamide gels. The medium was then removed from the 
5 well. Cells were washed three times with PBS, scraped from the dish into PBS, pelleted 
by centrifugation, and dissolved in 25 nl of loading buffer (0.06 M Tris-HCl, pH 6.7, 
2% SDS ? 5% P-mercaptoethanol. 5% glycerol, 0.05% bromophenol blue). One-fifth of 
the sample was analyzed on the gel. After electrophoresis, the gels were stained with 
Coomassie brilliant blue R, dried, and autoradiographed. The rates of inhibition of host 

10 cell protein synthesis were compared by quantitating the amount of radioactivity in the 
section of the gel containing only host proteins. The results of this study are shown in 
Figure 5, and demonstrate that the level of inhibition of host cell protein synthesis is 
significantly lower in SIN-1 virus infected cells, compared to wild-type virus infected 
cells, particularly at the earlier 6 and 9 hour time points post infection. 

15 In summary, BHK cells survive SIN-1 infection, virus levels equivalent 

to wild-type strains are produced in BHK cells, the level of viral-specific RNA 
synthesized is 10-fold lower than wild-type virus, and the level of inhibition of host cell 
protein synthesis in SIN-1 virus infected cells is significantly lower compared to wild- 
type virus infected cells. The phenotypes of the SIN-1 virus described herein map to 

20 the viral nsP genes. 



EXAMPLE 2 

Isolation and Characterization of Positive Strand RNA Viruses Which 
25 Exhibit Reduced Inhibition of Host Macromolecular Synthesis 

The derivation of virus variants exhibiting the desired phenotypes of 
reduced, delayed or no inhibition of host cell macromolecular synthesis is dependent on 
the generation, characterization, and isolation of sequences which differ from that of 
30 wild-type virus. However, with the exception of example 1, there are no obvious or 
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previously disclosed methods to select for or identify coding or non-coding viral 
sequence changes that result in alteration of this virus-based inhibition of 
macromolecular synthesis, or the generation of viruses that lead to persistent, rather 
than lytic, infection. This example provides specific methods and that enable one to 
5 overcome these obstacles. 

A. Biological Selection of Virus Variants 

The biological derivation of virus variants which result in reduced, 
delayed, or no inhibition of host macromolecular synthesis, or which establish persistent 

10 infections, can be performed by allowing for natural, spontaneous mutation within a 
cell, or first subjecting the desired virus stock to physical, chemical or other artificial 
mutagenesis, followed by infection of susceptible cells, and successive enrichments for 
those cell populations which harbor mutated virus. It is possible that prior mutagenesis, 
although not required, will facilitate the generation of appropriate mutations. The 

15 selection is based on the ability of cells infected with the desired variant to survive for 
significantly longer periods than wild-type virus infected cells. The following example 
provides methods in detail, using Sindbis virus as an example; however, other viruses 
are readily substituted, as noted in the detailed description. 

Specifically, in the case of chemical mutagenesis, a Sindbis virus stock 

20 suspension with a titer of greater than or equal to 10 9 pfii/ml is treated with 
nitrosoguanidine at a final concentration of 100 ug/ml. After 15 minutes at room 
temperature, the nitrosoguanidine is removed by dialysis at 4°C and the mutagenized 
stock is subsequently used for infection. Approximately 5xl0 6 cells of the desired type 
(for example BHK-21), grown in flat stock culture, are infected with the mutagenized 

25 virus at a multiplicity of infection (M.O.I.) of approximately 5, to ensure that every cell 
is infected. At 12, 24, 36, and 48 hours post-infection, the cell monolayer is washed 
twice with fresh media to remove dead cells, and replaced with media consisting of a 
mixture of 50% fresh media and 50% conditioned media. After a desired time post- 
infection (for example 72 hours), the remaining cells are gently trypsinized to detach 

30 them from the culture dish and strip away any cell-associated extracellular virus, which 
is then separated from the cells by differential centrifugation. The remaining cells are 
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then re-seeded directly into a tissue culture dish containing a semi-confluent uninfected 
cell monolayer, at a ratio of 1 infected cell for every 10 3 uninfected cells. Additional 
rounds of selection are performed by seeding onto uninfected cells for amplification, 
followed by the above washing and harvesting steps. Alternatively, the initial 
5 infections may be done using a wild-type virus stock at low M.O.I., allowing for 
spontaneous mutation during replication within the cell. Using this approach, a 
heterogenous population of mutant virus is produced by the infected cells. In those 
instances where the population of infected cells recovered after trypsinization includes 
a significant number of non-viable or severely damaged cells, a brief treatment (5 

10 minutes at room temperature) with 0.75% NH 4 C1 in 1 7 mM Tris (pH 7.65 with HC1), or 
centrifugation through Percoll™, is included to remove remaining dead or damaged 
cells, prior to re-seeding onto uninfected cell monolayers. After a minimum of two 
successive rounds of selection, virus variants displaying the desired phenotype are 
isolated by limiting dilution or plaque purification, and subjected to cDNA cloning as 

15 described in Example 1. Isolation and characterization of specific sequences 
responsible for the variant phenotype are accomplished by substitution of defined 
regions of cDNA into a genomic clone or expression vector and testing for the 
accompanying phenotypic change, as outlined in the Examples. 

20 B. Genetic Selection of Virus Variants 

In a related approach, natural mutation or random mutagenesis is 
performed, not on a virus stock, but rather, using cloned genomic cDNA of the virus 
that can be transcribed into infectious viral RNA in vitro or in vivo. For example, in the 
.case of prior mutagenesis, plasmid pRSINg, which contains a full-length genomic 

25 Sindbis cDNA functionally linked to a bacteriophage SP6 promoter (Dubensky, et al., 
J. Virol. 70:508-519, 1996), is transformed into competent E. coli XLl-Red mutator 
cells and plated on ampicillin plates to obtain colonies. At least 200 colonies are 
chosen at random, pooled, and inoculated for overnight growth in a 10 ml broth culture 
containing ampicillin. Plasmid DNA is prepared from the culture to obtain a 

30 heterogeneous population of pRSINg harboring various mutations. The DNA is 
linearized with Xba I and transcribed in vitro using SP6 polymerase, as described 
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previously (Rice et al., 1 Virol 67:3809-3819, 1987). RNA transcripts are 
subsequently transfected into the desired cell type (for example, BHK cells) by 
electroporation (Liljestrom and Garoff, Bio/Technology 9:1356-1361, 1991), for 
initiation of the Sindbis virus infection cycle. Alternatively, in vitro transcribed RNA 
5 from an unmutagenized template may also be transfected. Selection of virus mutants 
which establish a persistent infection or exhibit reduced, delayed, or no inhibition of 
host macromolecular synthesis is performed as above. The selected virus variants with 
the desired phenotype are isolated by limiting dilution or plaque purification, subjected 
to cDNA cloning, and the sequences responsible for the variant phenotype are isolated 
1 0 and characterized, as described previously. 

C Genetic Selection of Variants Using Virus-Derived Vectors 
1. Vectors Expressing an Immunogenic Protein 

In another approach, spontaneous intracellular mutation, or random 

1 5 mutagenesis is performed on virus-derived sequences of a viral-based expression vector. 
These sequences include non-coding and regulatory regions, as well as nonstructural 
protein encoding regions. In certain instances, structural protein-encoding sequences 
also may be included. Such random mutagenesis or spontaneous intracellular mutation 
may be performed using any of the techniques described in this invention, along with 

20 the cloned cDNA of a virus-derived vector which can be transcribed into RNA in vitro 
or in vivo. For example, a replication-competent Sindbis virus expression vector can be 
used to express an immunogenic cell surface protein or other peptide which may be 
bound by specific antibodies added to the infected cells. Cells which contain functional 
vector are identified by their expression of the vector-encoded heterologous antigen and 

25 ability to be bound by antibody specific for the encoded antigen. By limiting the 
selection process to cells surviving for extended periods (see above), only those 
harboring vector variants exhibiting the desired phenotype are enriched. 

Specifically, plasmid pTE3'2J (Hahn etal, J. Virol. 59:2679-2683, 
1992), comprising an SP6 promoter operably linked to a full-length genomic Sindbis 

30 cDNA with a duplicated subgenomic promoter for expression of heterologous genes, is 
subjected to random mutagenesis, as described above. This process results in isolation 
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of a population of heterogeneous plasmid containing the random mutations. In parallel, 
the desired heterologous cell surface protein or marker peptide gene is cloned into a 
shuttle vector for insertion into the mutagenized pTE3'2J vector. Preferred cell surface 
proteins for use as markers include human B7.1 (Freeman et al., J. Immunol. J 43:27 \4- 
5 2722, 1989) and the murine H-2K" class I molecule (Song et al.. J. Biol. Chem. 
269:7024-7029, 1994). The human B7.1 gene is amplified by standard three-cycle 
PCR, with 1.5 minute extension, from a pCDM8 vector containing the full-length 
cDNA sequence (Freeman et al., ibid), using the following oligonucleotide primers that 
are designed to contain flanking Xba I and Bam HI sites. 

10 

Forward pri mer: hB7.1 FX f5'-rest. site/B7.1 sequenced rSEO ID. NO. 24) 
5-ATATATCTAGA/GCCATGGGCCACACACGGAGGCAG-3' 

Reverse pri mer: hB7.1 RB (5'-rest. site/B7.1 seq uence) fSFQ ID. NO. 25) 
1 5 5'-ATATAGGATCC/CTGTTATACAGGGCGTACACTTTC-3' 

Following amplification, the approximately 875 bp DNA fragment is 
purified using a QIAquick-spin PCR purification kit (Qiagen, Chatsworth, CA), 
digested with Xba I and Bam HI and ligated into Sindbis shuttle vector pH3'2Jl (Hahn 

20 et al., ibid) that also has been digested with Xba I and Bam HI and treated with calf 
intestinal alkaline phosphatase, to create the construct pH3'B7. 1 . 

Following random mutagenesis of the pTE3'2J double subgenomic 
vector, as described above, plasmid pH3'B7.1 and the mutated population of plasmid 
pTE3'2J are digested with Apa I and Xho I, purified from 0.7% agarose gels using 

25 GENECLEAN II II™ (Biol 01, Vista, CA), and ligated to form a heterogeneous 
population of a B7.1 expression vector, designated pTE3'B7.1. Without transforming 
£. coli and isolating individual clones, the entire population of ligated vector is 
linearized with Xho I and used as template for in vitro SP6 transcription reactions, as 
described above. The heterogeneous population of randomly mutagenized B7.1 vector 

30 transcripts is then electroporated into the desired cell type (for example, BHK cells) for 
initiation of the Sindbis replication cycle. Selection for virus mutants which establish a 
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persistent infection or exhibit reduced, delayed, or no inhibition of host macromolecular 
synthesis is performed using a monoclonal antibody specific for B7.1 (Pharmingen, San 
Diego, CA) and either magnetic- or fluorescence-activated cell sorting protocols. The 
preferred secondary antibody tags include rat-anti-mouse IgG conjugated with magnetic 
5 microbeads for magnetic cell sorting (miniMACS Magnetic Separation System, 
Miltenyi Biotec, Auburn, CA; Miltenyi et ah, Cytometry 77:231-238, 1990), and FITC- 
conjugated rat anti-mouse IgG (Pharmingen, San Diego, CA) for fluorescence activated 
cell sorting. Using such an approach and harvesting cells after an extended period {see 
above), only viable cells which contain a functional virus-derived vector (as evidenced 

10 by B7. 1 expression), displaying the desired phenotype, are enriched. 

Specifically, the heterogeneous population of randomly mutagenized 
B7.1 vector transcripts is electroporated into 5xl0 7 cells, according to the procedure of 
Liljestrom and Garoff (1991, ibid), and plated as a flat stock culture. At 12, 24, 36, and 
48 hour post-infection, the cell monolayer is washed twice with fresh media to remove 

15 dead cells, and replaced with media consisting of a mixture of 50% fresh media and 
50% conditioned media. After a desired time post-infection (for example 72 hours), *he 
remaining cells are gently trypsinized to detach them from the culture dish, and pelleted 
by centrifugation at 1000 rpm, 4°C. The cells are resuspended in 2 ml of blocking 
solution (PBS + 10% fetal calf serum + 1 % BSA), incubated on ice for 1 0 minutes, and 

20 re-pelleted. Next, the cells are resuspended in 200 ul of the primary anti-B7.1 antibody 
solution (diluted in PBS + 0.5% BSA), and incubated on ice for 30 minutes. The cells 
are washed twice with PBS + 0.5% BSA, pelleted, and resuspended in 200 ul of 
magnetic bead solution (200 ul washed magnetic rat anti-mouse coated beads in PBS + 
. 0.5% BSA + 5 mM EDTA). Following incubation at 4°C for 30 minutes, the bead- 

25 bound cells are washed twice with PBS + 0.5% BSA + 5 mM EDTA, and resuspended 
in 1 ml of the same buffer. The bead-bound cells are then purified using the MiniMacs 
magnet column, according to the manufacturer's directions. The eluted positive cells 
are then re-seeded directly into a tissue culture dish containing a semi-confluent 
uninfected cell monolayer, at a ratio of 1 infected cell for every 10 4 uninfected cells. 

30 Additional rounds of selection are performed as above for amplification/enrichment. 
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The selected vector variants with the desired phenotype are isolated by limiting dilution 
or plaque purification, subjected to cDNA cloning, and the sequences responsible for 
the variant phenotype are isolated and characterized, as described previously. 

Alternatively, the B7.1 expression vector, pTE3'B7.1, may be used 
5 directly for in vitro transcription without prior mutagenesis. Following transfection of 
these RNA transcripts, selection for mutants of the desired phenotype is performed as 
described above. 

2. Vectors Expressing a Selectable Marker 

1 0 Alternatively, an antibiotic resistance marker can be used for selection of 

virus vector variants exhibiting the desired phenotype. For example, the Sindbis vector 
pRSIN-luc (Dubensky et a/., ibid) is subjected to random mutagenesis as described 
above and a population of heterogeneous plasmid containing the random mutations is 
isolated. In addition, the gene encoding neomycin (G418) resistance is isolated by 

15 standard three-cycle PCR amplification, with 1.5 minutes extension, from plasmid 
pcDNA3 (Invitrogen, San Diego, CA), using the following oligonucleotide primers that 
are designed to contain flanking Xho I and Not I restriction sites: 

Forward pri mer: NeoFX (5'-rest. site/neo sequence! (SEQ. ID. NO. 26) 
20 S^ATATACTCGAG/ACCAUGATTGAACAAGATGGATTG-S' 

Reverse pri mer: NeoRN (5'-rest. site/neo sequence! (SEQ. ID. NO. 27) 
5 f -TATATAGCGGCCGC>TCAGAAGAACTCGTCAAGAAG-3 , 

25 Following amplification, the DNA fragment is purified with QlAquick- 

spin, digested with Xho 1 and Not I, and Iigated into the mutated population of pRSIN- 
luc vector that also has been digested with Xho I and Not I, treated with calf intestinal 
alkaline phosphatase, and purified from a 0.7% agarose gel, away from its previous 
luciferase insert, using GENECLEAN II. Ligation of the neomycin resistance marker 

30 into the mutated pRSFN vector creates a heterogeneous population of a neo r expression 
vectors, designated pRSIN-neo. Without transforming E. coli and isolating individual 
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clones, the entire population of ligated vector is linearized with Sac I and used as 
template for in vitro SP6 transcription reactions, as described above. The 
heterogeneous population of randomly mutagenized neo r vector transcripts is 
electroporated into the desired cell type (for example, BHK cells) for initiation of the 
5 Sindbis replication cycle and heterologous gene expression. Approximately 12-24 hour 
post-transfection, the cells are trypsinized and replated in media containing G418. 
Subsequently, the media is changed at approximately 24 hour intervals to remove dead 
cells, and replaced with G418-containing media consisting of a mixture of 50% fresh 
media and 50% conditioned media. Media changes are reduced after the majority of 

10 dead cells are washed away, and cell foci begin to form. Using this selection, only 
viable cells which contain a functional virus-derived vector, exhibiting the desired 
phenotype (as evidenced by neomycin resistance), are enriched. Mutant vector variant 
genomes displaying the desired phenotype are isolated by harvesting RNA directly from 
cell lysates using RNAzol B (Tel-Test, Friendswood, TX). Alternatively, the cells are 

15 transfected with a defective-helper Sindbis RNA (pR-dlnsPSINL Dubensky et aL ibid), 
which results in the production of packaged pRSIN-neo vector-containing particles* in 
the culture supernatant, which may be concentrated by centrifugation for subsequent 
vector RNA isolation, as above. In each case, the vector RNA is subjected to cDNA 
cloning, and the sequences responsible for the variant phenotype are isolated and 

20 characterized, as described previously. 

Alternatively, the pRSIN vector with a neomycin resistance marker, may 
be used directly for in vitro transcription without prior mutagenesis. Following 
transfection of these RNA transcripts, selection for mutants of the desired phenotype is 
^ performed as described above. 

25 In another example, a Semliki Forest virus derived vector pSFV-1 

(GIBCO/BRL) is used for insertion of the antibiotic resistance marker and subsequent 
selection of the desired phenotype. The gene encoding neomycin (G418) resistance is 
isolated by standard three-cycle PCR amplification, with 1.5 minutes extension, from 
plasmid pcDNA3 (Invitrogen, San Diego, CA), using the following oligonucleotide 

30 primers that are designed to contain flanking BamH 1 restriction sites: 
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Forward primer: 5'BAMHI-Neo (SEQ. ID. NO. 1 1 8) 

5 '-ATATAGGATCCTTCGCATGATTGAACAAGATGGATTGC-3 ' 

5 

Reverse primer: 3'BAMHI-Neo (SEQ. ID. NO. 57) 

5 * - AT ATAGG ATCCTC AG A AG A ACTCGTC A AG A AGGCG A-3 ' 

10 Following amplification, the DNA fragment is purified with QIAquick- 

spin, digested with BamH I, and ligated into wild-type or a previously mutated 
population of pSFV-1 vector that also has been digested with BamH I, treated with calf 
intestinal alkaline phosphatase, and purified from a 0.7% agarose gel, using 
GENECLEAN II. Transcription of RNA vectors and selection for mutants of the 

1 5 desired phenotype are carried out essentially as described above for Sindbis virus. 



EXAMPLE 3 

Preparation of SIN 1 -Based RNA vector replicons 

20 

A. Construction of the SIN-1 Basic Vector 

SIN-1 derived vector backbones were constructed and inserted into a 
plasmid DNA containing a bacteriophage RNA polymerase promoter, such that 
transcription in vitro produced an RNA molecule that acts as a self-replicating molecule 

25 (replicon) upon introduction into susceptible cells. The basic SIN-1 RNA vector 
replicon was comprised of the following ordered elements: SIN-1 nsPs genes, 
subgenomic RNA promoter region, a polylinker sequence, which may contain 
heterologous sequence insertions, the SIN-1 3' non translated region (NTR), and a poly 
adenylate sequence. Following transfection into susceptible cells, autonomous 

30 replication of the RNA vector replicon occurs as for virus, and the heterologous 
sequences are synthesized as highly abundant subgenomic mRNA molecules, which in 
turn serve as the translational template for the heterologous gene product. 



WO 97/38087 PCT/US97/06010 

117 

The 5' region of the vector, comprised of the SIN-1 nsP genes and 
subgenomic promoter, extends to within two nucleotides of the capsid gene 
translational initiation point. This region was first inserted into the pKSII+ plasmid 
(Stratagene) between the Apa I and Xho I sites. The 5' region of the vector was 
5 amplified by PCR from the pRSIN-lg plasmid in two overlapping fragments. The first 
fragment was generated in a PCR reaction with the following primer pair: 

Forward primer: S?6A¥(Aoa I site/SP6 promoter/SIN nts 1-1 8) : (SEQ. ID NO. 12) 
5'- 

10 TATATGGGCCCGATTTAGGTGACACTATAGATTGACGGCGTAGTACAC 

Reverse primer: SIN5160R (SIN nts 5 160-5 140) : (SEQ. ID NO. 28) 
S'-CTGTAGATGGTGACGGTGTCG 

1 5 The second fragment was generated in a PCR reaction with the following primer pair: 

Forward primer: 5079F (SIN nts 5079-51001 : (SEQ. ID NO. 29) 
5-GAAGTGCCAGAACAGCCTACCG 

20 Reverse primer: SIN7643R (buffer sequence/Xfto 1 site/ SIN nts 7643-762 IV 
(SEQ. ID NO. 30) 

5'-TATATCTCGAGGGTGGTGTTGTAGTATTAGTCAG 



The two PCR reactions were performed with the primer pairs shown 
25 above using the Thermalase (Amresco, Solon, OH), Vent (New England Biolabs, 
Beverly, MA) or KlenTaq thermostable DNA polymerases. Additionally, the reactions 
contained 5% DMSO and "HOT START WAX" beads (Perkin-EImer, Foster City, 
. CA). The PCR amplification protocol shown below was used. The extension period 
was 5 minutes or 2.5 minutes for reactions with the SP6-1F/SIN5160R or 
30 5079F/SIN7643R primer pairs, respectively. 
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Temperature (°C) Time(Min) No. Cycles 

95 2 i 

95 05 " 

55 0.5 35 

72 5.0 or 2.5 

72 To - ]0 



Following PCR, the two amplified products of 5142 bp (SP6- 
1F/SIN5160R primer pair) and 2532 bp (5079F/SIN7643R primer pair) were purified 
5 (PCR purification kit, Qiagen, Chatsworth, CA), and digested with Apa I and Sfi I (5 142 
bp amplicon product) or Sfi 1 and Xho I (2532 bp amplicon product). The digested 
products were purified with GENECLEAN II (Bio 101, Vista, CA) and ligated together 
with pKS+ plasmid (Stratagene, La Jolla, CA) prepared by digestion with Apa I and 
Xho I, and phosphatased with CIAP. This construction is known as pKSSIN-I-BV5'. . 
1 0 The 3' region of the vector, comprised of the viral 3' end, a polyadenylate 

tract, and a unique restriction recognition sequence were inserted between the Not I and 
Sac I sites of the plasmid pKSSIN-l-BV5'. The 3' region of the vector was amplified by 
PCR from the pRSIN-1 g plasmid in a reaction containing the following primer pair: 

15 Forward pri mer: SIN1 1386F (buffer seauence/Afor I site/SIN nts 1 1386-1 1407) : 
(SEQ. ID NO. 31) 

5'-TATATATATATGCGGCCGCCGCTACGCCCCAATGATCCGAC 
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Reverse primer: SIN1 1703R (buffer seauence/Sac 1 and Pme 1 sites/T40/SIN nts 1 1 703- 
11698) : (SEQ. ID NO. 32) 

5'-CTATAGAGCT CGTTTAAACT H11IMT1T TTTTTTTTTT 
TTTTTTTTTT TTTTTTTTTG AAATG 

5 

In addition to the primer pairs shown above, the PCR reaction contained 
Thermalase, Vent or KlenTaq thermostable DNA polymerase, 5% DMSO, and "Hot 
Start Wax" beads (Perkin-Elmer). The amplification protocol was as shown above, but 
with a 72°C extension period of 30 seconds. 

10 The 377 bp amplified product corresponding to the 3' vector end was 

purified, digested with Not I and Sac I, purified, and ligated into pKSSIN-l-BV5\ 
which was prepared by digestion with Not I and Sac I and treatment with CIAP. This 
plasmid is known as pKSSIN- 1 -BV. 

Using techniques described above, the lacZ gene encoding the p- 

15 galactosidase reporter protein was liberated from the plasmid pSV-p-galactosidase 
(Promega Corp., Madison, WI) with Bam HI and Hind III, and inserted into pKS+ af the 
corresponding enzyme recognition sites. The lacZ gene was digested from this plasmid, 
pKS-p-gal, with Xho I and Not I, and inserted into pKSSIN-l-BV, between the Xho I 
and Not I sites. This plasmid is known as SINrep/SIN-1 nsPl-4/lacZ. 

20 Alternatively, the firefly luciferase gene encoding the luciferase reporter 

protein was liberated from the plasmid pT3/T7-LUC (Clontech, Palo Alto, CA) by 
digestion with Hind III, and inserted into pKS+ (Stratagene, La Jolla, CA) at the 
corresponding enzyme recognition sites contained in the multiple cloning sequence to 
generate pKS-luc. The luciferase gene was liberated from pKS-luc by digestion with 

25 Xho I and Not I and inserted into Xho VNot I digested pKSSIN-l-BV. This plasmid is 
known as SINrep/SIN-1 nsPl-4/luc. 

Additionally, the gene encoding the secreted form of alkaline 
phosphatase (SEAP) was inserted into pKSSIN-l-BV. Briefly, the SEAP gene was 
liberated from the plasmid pCMV/SEAP (Tropix, Bedford, MA) by digestion with Hind 

30 III and Xba I, and inserted in pSK+ (Stratagene, La Jolla, CA) at the corresponding 
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recognition sites contained in the multiple cloning sequence, to generate pSK-SEAP. 
The SEAP gene was then liberated from pSK-SEAP by digestion with Xho 1 and Not I, 
and inserted into the corresponding enzyme recognition sites of pKSSIN-l-BV. This 
plasmid is known as SINrep/SIN-1 nsPl-4/SEAP 
5 The individual SIN-1 nsP genes were substituted into the corresponding 

wild-type virus region of the Sindbis virus-based lac Z replicon described previously 
(Bredenbeek et al., 1 Virol 67:6439-6446, 1993), in order to compare the expression 
properties of the SIN-1 and wild-type expression vectors. Substitution of the SIN-1 nsP 
genes into the Totol 1 01 -derived lac Z replicon was accomplished as described in 
10 Example 1. These vectors were designated as shown in the table below. 

nsP Genes Origin 
Replicon Designation Toto 1101 SIN- 1 

SINrep/lacZ nsP 1-4 

SINrep/SIN-1 nsP2/lacZ nsP 1J-4 nsP 2 

SINrep/SIN-1 nsP3/lacZ nsPl-2,4 nsP 3 

SINrep/SIN- 1 nsP 1 -4/lacZ i .4 

SP6 transcripts were prepared from the replicons shown in the table 
above, after linearization with Xho I, as described in Example 1. RNA transcripts 

15 contained a 5* sequence that is capable of initiating transcription of Sindbis virus, 
Sindbis virus nonstructural protein genes 1 -4, RNA sequences required for packaging, a 
Sindbis virus junction region, the lacZ gene, and the Sindbis virus 3' end proximal 
sequences required for synthesis of the minus strand RNA. 

The in vitro transcription reaction products or purified RNA were 

20 electroporated into baby hamster kidney-21 (BHK-21) cells as described previously 
(Liljestrom and Garoff, Bio/Technology 9:1356-1361, 1991). Alternatively, BHK-21 
cells were complexed with a commercial cationic lipid compound as described in 
Example 1 and applied to BHK-21 cells maintained at 75% confluency. Transfected 
cells were propagated in 35 mm dishes, and incubated at 37°C. 
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The efficiency of transfection of BHK-21 cells with SINrep/lac Z RNAs 
after 9 hours was determined by two alternative methods. In the first method, 
transfected cells expressing p-galactosidase were determined by direct staining with X- 
gal (5-bromo-4-chloro-3-indolyl-b-D-galactopyranoside), after first fixing cells with 2% 
5 cold methanol, as described previously (MacGregor et ah, Cell Mol Genet 75:253-265, 
1987). In the second method, expression of p-galactosidase were determined by 
immunofluorescence, using a rabbit anti- p-galactosidase antibody. A portion of the 
cells transfected by either method described above were propagated on circular glass 
coverslips, contained in 35 mm dishes. At 9 hours post transfection (hpt), the media 

10 was removed by aspiration, and the cells were rinsed twice with PBS, and fixed with 
methanol by incubation overnight at -20°C The methanol was removed by aspiration, 
the cells were rinsed three times with PBS, then incubated with 2% BSA (fraction V, 
Sigma, St. Louis, MO) for 30 minutes at room temperature. Following incubation with 
BSA to prevent non-specific antibody binding, the cells were incubated with the 

15 primary anti-p-galactosidase antibody (diluted 1:800 in 0.1% BSA/PBS) for 1 hour at 
room temperature. Excess primary antibody was then removed by aspiration and 
rinsing three times with 0.1% BSA in PBS. Following 1:100 dilution in 2% BSA in 
PBS, 100 ml of goat anti-rabbit-FITC conjugate secondary antibody (Sigma, St. Louis, 
MO) were added to the coverslips and incubated for 45 minutes at room temperature, in 

20 the dark. Excess secondary antibody was removed by rinsing the coverslips twice with 
0.1% BSA in PBS, and once with PBS. The coverslips were then mounted cell side 
down on a drop of Cytoseal 60 mounting media (Stephens Scientific, Riverdale, NJ), 
placed on a microscope slide. Fluorescence microscopy was used to determine the 
frequency of cells expressing P-galactosidase, in order to determine the transfection 

25 efficiency. 

The level of p-galactosidase in whole transfected cell lysates was 
determined at 9 hpt, by two alternative methods. In the first method, transfected cells 
were rinsed with PBS after aspiration of the media, and 250 \xl of reporter lysis buffer 
(Promega, Madison, WI) per 10 6 cells was added to each dish, P-galactosidase 
30 expression levels were determined by mixing the supernatant fraction from cell lysates, 
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processed by micro centrifugation at .14.000 r.p.m. for 1 minute at room temperature, 
with a commercially available substrate detection system (Lumi-gal, Clontech, Palo 
Alto, CA), followed by luminometry (Analytical Luminescence Laboratory, San Diego, 
C A). In the second method, the activity of p-galactosidase was determined as described 
5 previously by Sambrook and Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory 
Press, NY). Briefly, transfected cells were rinsed with PBS after aspiration of the 
media, and 200 nl of TTE lysis buffer (10 mM Tris-HCI pH 8.0/1 mM EDT A/0.2% 
Triton X-100) per 10 6 cells was added to each dish. Following pelleting of cell debris 
from the cell lysate by micro centrifugation at 14,000 r.p.m. for 1 minute at room 

10 temperature, 50 |xl of the supernatant was added to 0.5 ml of Z buffer pH 7.0 (60 mM 
Na 2 HP(Y40 mM NaH 2 P0,/ 10 mM KC1/10 mM MgS04/50 mM 2-mercaptoethanol) 
and incubated at 37°C for 5 minutes. P-galactosidase activity in samples was then 
determined by spectrophotometry (420 nm) after addition of 0.2 ml of a solution 
containing 4 mg/ml of the chromogenic substrate o-nitrophenyl-b-D-galactopyranoside 

15 (ONPG; SIGMA, St. Louis, MO), in Z buffer. The samples were incubated at 37°C 
until the yellow color developed (approximately 5 minutes), and the reactions were 
terminated by addition of 0.5 ml of 0.5 M Na,CO v 



determined according to the methods described above, and is illustrated in the table 
20 shown below. The data indicated are normalized for varying transfection efficiency, as 
described above. 



The level of P-galactosidase expression in transfected BHK-21 cells was 



Replicon Transfected 



Expression of P-gal, 
relative to SINrep/lacZ 



Standard Deviation 



SINrep/lacZ 
SINrep/SIN-1 nsP2/lacZ 
SINrep/SIN-1 nsP3/IacZ 
SINrep/SIN-1 nsPl-4/lacZ 



0.2 



3.2 



5.2 



1.0 



+ 0.0 



+ 0.4 



+ 1.3 
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The results demonstrate that the replicon vectors derived from the SIN-1 
variant strain are indeed functional. When transfected into BHK-21 cells, the level of 
expressed reporter protein are higher than that observed in wild type virus-derived 
vector transfected cells. Furthermore, as with the phenotype for establishment of 
5 productive persistent infection, the higher level of p-galactosidase expression in BHK- 
21 cells transfected with the SIN-1 -derived replicon vectors mapped primarily to the 
nsP 2 gene. 

EXAMPLE 4 

10 Preparation of SIN 1 -Based DNA Vectors 

A. Construction of Plasmid DNA SIN-1 Derived Expression Vectors 

Efficient initiation of the Sindbis virus infectious cycle can occur in vivo 
from a genomic cDNA clone contained within an RNA polymerase II expression 

15 cassette (Dubensky et al„ J. Virol 70:508-519, 1996.). The ability to express functional 
alphavirus genes from a DNA format has enabled two new alphavirus-based gene 
expression systems to be developed: (1) a plasmid DNA-based vector with applications 
for genetic immunization, and (2) the production of packaged alphavirus particles in 
cells co-transfected with vector replicon and DH plasmid DNAs. Previously, molecular 

20 approaches to produce infectious Sindbis virus RNA and its derived complementary 
vectors were restricted primarily to in vitro transcription of cDNA clones from a 
bacteriophage RNA polymerase promoter followed by transfection into permissive 
cells. 

The plasmid DNA-based alphavirus derived expression vector is known 
25 as ELVS™ (Eukaryotic Layered Vector System). The ELVS™ plasmid DNA vector 
involves the conversion of a self-replicating vector RNA (replicon) into a layered DNA- 
based expression system. Within certain embodiments the first layer has a eukaryotic 
{e.g. RNA polymerase II) expression cassette that initiates transcription of a second 
layer, which corresponds to the RNA vector replicon. Following transport of the 
30 replicon expressed from the first layer from the nucleus to the cytoplasm, autocatalytic 
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amplification of the vector proceeds according to the viral {e.g. alphavinis) replication 
cycle, resulting in expression of the heterologous gene. 

Construction of SIN-1 virus-derived plasmid DNA expression vectors 
were performed with modifications of methods previously described (Dubensky et al M 
5 J. Virol 70:508-5 19, 1996). The expression vector was assembled on the plasmid vector 
pBGS131 (ATCC No. 37443), which is a knamycin resistant analogue of pUC 9 (Spratt 
et al., Gene 47:337-342, 1986). pBGS131 and its derived plasmids were propagated in 
LB medium containing 20 |ig/ml kanamycin. ^ 

To facilitate insertion of heterologous sequences into the expression 
1 0 vector, the Xho 1 recognition sequence, located within the translation^ reading frame of 
the kanamycin gene in pBGS131, was removed by inserting a partially complementary 
12-mer oligonucleotide pair that contained Xho 1 sticky ends. The Xho I recognition 
site was lost as a result of the insertion, as shown below: 

1 5 Oligonucleotide 1 : (SEQ. ID NO. 33) 
S'-TCGATCCTAGGA 

pBGS 1 3 1 sequence after 

Original pBGS 1 3 1 sequence Paired Oligonucleotides insertion of 1 2-mer 

SerArg SerlleLeuGlySerArg 

CTCGAGGC TCGATCCTAGGA CTCGATCCTAGGATCGAGGC 

GAGCTCCG AGGATCCTAGT GAGCTAGGATCCTAGCTCCG 

^ (Xho I site: CTCGAG) (SEQ. ID NOS. 33-36 and 104). 

20 

The oligonucleotide is gel annealed in equal molar concentrations in the 
presence of 10 raM MgCl 2 , heated to 100°C for 5 min, cooled slowly to room 
temperature, and phosphorylated with polynucleotide kinase. The oligonucleotide was 
ligated at a 200:1 ratio of insert:plasmid vector to pBGS131, which was prepared by 
25 Xho I digestion and CIAP treatment. The resulting plasmid is called pBGSl 3 1 dlXho I. 
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The growth rates of XL 1 -Blue (Stratagene) transformed with pBGSI31 or pBGS131 
dVCho I plasmids in LB medium containing 20 jig/ml kanamycin was indistinguishable 
over a time course between 1 .5 and 8 hours. 

The bovine growth hormone (BGH) transcription 
5 termination/polyadenylation signal was inserted between the Sac I and Eco RI sites of 
pBGSBl dlXho I. The BGH transcription termination sequences were isolated by PCR 
amplification using the primer pair shown below and the pCDNA3 plasmid (Invitrogen, 
San Diego, CA) as template. 

10 Forward primer BGHTTF (buffer sequence/Sac 1 site/pCDNA3 nts 1 132-1 16H : 
(SEQ. ID NO. 37) 

S'-TATATATGAGCTCTAATAAAATGAGGAAATTGCATCGCATTGTC 

Reverse primer BGHTTR (buffer seouence/£co RI site/pCDNA3 nts 1180-1 154) : 
15 (SEQ. ID NO. 38) 

5'-TATATGAATTCATAGAATGACACCTACTCAGACAATGCGATGC 

The primers shown above were used in a PCR reaction with a three 
temperature cycling program, using a 30 sec extension period. The 58 bp amplified 

20 product was purified with the PCR purification kit (Qiagen Chatsworth, CA), digested 
with Sac I and Eco RI, purified with GENECLEAN II, and ligated into Sac \IEco RI 
digested, CIAP treated pBGS131. The plasmid is known as pBGS131 AVCho I-BGHTT. 

The 3' end of the Sindbis virus-derived plasmid DNA expression vector 
was then inserted into the pBGS131 dlXho I-BGHTT construct. This region of the 

25 vector contains the following ordered elements: Sindbis virus 3 1 end non-translated 
region (3' NTR); a 40-mer poly(A) sequence; the hepatitis delta virus (HDV) 
antigenomic ribozyme; and a Sac I recognition sequence. Construction of these ordered 
elements was accomplished by nested PCR, using the primers shown below and the 
pKSSIN-l-BV plasmid (Example 4) as template. 

30 
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Forw a rd primer: SIN1 1386F (buffer sequence/M>/ 1 sife/SIN nts 1 1 186-1 tdmy 
(SEQ.IDNO.31) 

5'-TATATATATATGCGGCCGCCGCTACGCCCCAATGATCCGAC 

5 Nested primer: dAHDVIF rno lWAVHDV RR7 nt« \^*y ( S £Q. ID NO. 39) 
S'-AAAAAAAAAA GGGTCGGCAT GGCATCTCCA CCTCCTCGCG 
GTCCGACCTG GGCATC 

Reverse primer: Sar.HDV77R fbuff er seouence/.Sfrc ] site/HDV RBZ nts 77-77) - 
10 (SEQ. ID NO. 40) 

5'-TATATGAGCTCCTCCCTTAGCCATCCGAGTGGACGTGCGTCCTCCTT 
CGGATGCCCAGGTCGGACCGCG 



The primers shown above were used in a PCR amplification according to 
1 5 the reaction conditions and three temperature cycling program described in Example 4, 
with an extension time of 30 sec. The 422 bp amplified product was purified with'a 
PCR purification kit (Qiagen Chatsworth, CA), digested with Not I and Sac I, purified 
with GENECLEAN II, and ligated into Not VSac I digested, ClAP-treated pKSSIN-1- 
BV. This construct is known as pKSSIN-lBV/HDVRBZ and contains Sindbis virus- 
20 derived plasmid DNA expression vector sequences from the Bgl II site at Sindbis nt 
2289 extending through the 3* end of the vector including the HDV ribozyme sequence. 

Plasmid pKSSIN-lBV/HDVRBZ was then digested with Bgl II and 
Sac I, the 5815 bp fragment was isolated by 1% agarose/TBE gel electrophoresis. 
- purified with GENECLEAN II, and was inserted into Bgl WSac I digested, C1AP- 
25 treated pBGS131 dlXho I-BGHTT to generate the plasmid construct known as 
pBG/SIN-lBglLF. This construct contains the region of the Sindbis virus expression 
vector from plasmid pKSSIN-lBV/HDVRBZ described above with the 3' end fused to 
the BGH transcription termination sequence on the pBGS131 dWto I plasmid. 

Assembly of the Sindbis virus plasmid DNA vector was completed by 
30 insertion of the CMV promoter juxtaposed with the first 2289 nts of the Sindbis virus 
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genome (includes the 5' viral end and a portion of the nsPs genes) into the pBG/SIN- 
lBglLF plasmid. Using an overlapping PCR approach, the CMV promoter was 
positioned at the 5' viral end such that transcription initiation results in the addition of a 
single non-viral nucleotide at the 5' end of the Sindbis virus vector replicon RNA. The 
5 CMV promoter was amplified in a first PCR reaction from pCDNA3 (Invitrogen, San 
Diego, CA) using the following primer pair: 

Forward primer: pC/fe/233F (buffer sequence/Bel II recognition sequence/CMV 
promoter nts 1-22^ : (SEQ. ID NO. 41) 
10 5'-TATATATAGATCTTTGACATTGATTATTGACTAG 

Reverse primer: SNCMV1 142R (SIN nts 8*1 /CMV pro nts 1 142-1 108> : 
(SEQ. ID NO. 42) 

S'-CCGTCAATACGGTTCACTAAACGAGCTCTGCTTATATAGACC 

15 

The primers shown above were used in a PCR reaction according to *the 
reaction conditions and three temperature cycling program described in Example 4, with 
an extension time of 1 min. 

The SIN-1 5' end was amplified in a second PCR reaction from 
20 pKSRSIN-lg clone (Example 1 ) using the following primer pair: 

Forward primer: CMVSIN1F (CMV pro nts 1 124-1 142/SIN nts 1-20V 
(SEQ. ID NO. 43) 

25 S'-GCTCGTTTAGTGAACCGTATTGACGGCGTAGTACACAC 
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Reverse pri mer: SIN 3 1 82R (SIN nts 3 1 82-3 1 6f)) * (SEQ ID NO. 44) 

5'-CTGGCAACCGGTAAGTACGATAC 

5 The primers shown above were used in a PCR reaction with a three 

temperature cycling program using a 3 min extension period. 

The 930 bp and 3200 bp amplified products were purified with a PCR 
purification kit (Qiagen) and used together in a PCR reaction with the following primer 
pair: 

10 

Forward primer: pCflff/233F : (SEQ. ID NO. 4 1 ) 

S^TATATATAGATCTTTGACATTGATTATTGACTAG 
15 Reverse primer: (SIN nts 2300-2278) : (SEQ. ID NO. 45) 

» 

5'-GGTAACAAGATCTCGTGCCGTG 

The primers shown above were used in a PCR reaction with a three 
20 temperature cycling program using a 3.5 min extension period. 

The 26 3' terminal bases of the first PCR amplified product overlap with 
the 26 5' terminal bases of the second PCR amplified product; the resultant 3200 bp 
overlapping secondary PCR amplified product was purified by 1% agarose/TBE 
* electrophoresis, digested with Bgl II, and ligated into Bgl II digested, CIAP-treated 
25 pBG/SIN-lBglLF. This construct is called pBG/SIN-1 ELVS 1.5. 

As discussed within Example 1 , relatively few nucleotide point changes 
in the nsP gene sequence of wild-type Sindbis virus result in the phenotype 
characteristic of SIN- 1. No new restriction enzyme recognition sites are generated as a 
result of these nucleotide changes which facilitate clones derived from wild-type and 
30 SIN-1 genotypes to be easily distinguished. A PCR-based diagnostic assay was 
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therefore devised as a rapid method for identification of SIN- 1 derived clones. Briefly, 
forward primers were designed so that a particular base change between SIN-1 and 
wild-type was positioned at the 3' terminal base of the primer. One primer contained 
the SIN-1 nucleotide while another contained the wild-type nucleotide. A reverse 
primer in a region downstream conserved between both genotypes was used in 
combination with each forward primer. At the correct annealing temperature, SIN-I 
templates were only amplified in reactions containing SIN-1 forward primers. The 
primer sequences used to distinguish wild-type and SIN-1 genotypes is given below. 
The reaction conditions were as described throughout the examples contained herein. 

Primer Set I : 

Forward primers: 

WT 1 OOF 5'-GTC CGT TTG TCG TGC A AC TGC 

(SEQ. ID NO. 105) 

SIN- 1 1 OOF: 5'-GTC CGT TTG TCG TGC A AC TGA 
(SEQ. ID NO. 106) 



Reverse primer : 
20 SIN2300R 

PCR Program: (95°C-30 r \ 72°C-2') 20 cycles 

Primer Set 2 : 
25 Forward primers: 

WT3524F 5'-CAA TCT TCC TCA CGC CTT AGC 

(SEQ. ID NO. 107) 



15 



30 



SIN-13524F 5'-CAA TCT TCC TCA CGC CTT AGT 
(SEQ. ID NO. 108) 
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Reverse primer: 
SIN5448R 

5 PCR Program: (95°C-30", 60°C-30", 72°C-2') 20 cycles 

Primer Set 3 : v 
Forward primers: 

WT7592F 5TCC TAA ATA GTC AGC ATA GTA 

10 (SEQ.IDNO. 109) 



SIN- 1 7592F 5'TCC TAA ATA GTC AGC ATA GTT 
(SEQ.IDNO. 110) 

1 5 Reverse primer: 

SIN7643R 5'-TATATCTCGAGGGTGGTGTTGTAGTATTAGTCAG ' 

(SEQ. ID NO. Ill) 

PCR Program: (95°C-30 M , 60 o C-30", 72°C-2') 20 cycles 

20 

Reporter protein expression vectors were constructed by inserting the 
lacZ, SEAP, or luciferase reporter genes into the pBG/SIN-1 ELVS 1.5 vector 
backbone. In separate reactions, the pKS-0-gaI, pSK-SEAP, and pKS-luc plasmids 
- (Example 4), were digested with Xho I and Not I. The fragments containing the lacZ, 
25 SEAP, or luciferase genes were isolated by 1% agarose/TBE gel electrophoresis and 
purified subsequently with GENECLEAN II. These reporter genes were then ligated in 
separate reactions with XhoVNot I digested, CIAP-treated pBG/SIN-1 ELVS 1.5 
plasmid. These constructs are known as pBG/SIN-1 ELVS 1.5-p-gal, pBG/SIN-1 
ELVS 1.5-SEAP, and pBG/SIN-1 ELVS 1.5-Iuc. 

30 
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B - Expression of Heterologous Proteins in Cells Transfected with pBG/£TN-i 

ELVS 1.5-SEAP. pBG/SIN-1 ELVS 1.5-luc or dBG/SIN-1 ELVS l.S-p- p a l 
Expression Vectors 

The pattern of secreted alkaline phosphatase, luciferase, and (J- 

5 galactosidase reporter gene expression in BHK cells transfected with pBG/SIN-1 ELVS 

1.5 or pBG/wt ELVS 1.5 vectors was compared. The pBG/wt ELVS 1.5 plasmid 

contains sequences derived from wild-type Sindbis virus, rather than the SIN-1 variant. 

Construction of the pBG/wt ELVS 1.5 expression vectors was exactly as described 

herein for the pBG/SIN-1 ELVS 1.5 expression vectors, except that full-length genomic 

10 cDNA derived from wild-type Sindbis virus (Dubensky et al., WO 95/07994) was used 
as the template for the vector construction. Construction of the pBG/wt ELVS 1.5 
expression vector has been described previously (Dubensky, supra.): thus, although the 
strains, and therefore the sequences, are different, the Sindbis virus-specific regions 
contained in the pBG/wt ELVS 1.5 and pBG/SIN-1 ELVS 1.5 expression vectors are 

1 5 the same. 

Baby hamster kidney-21 (BHK-21) cells maintained at 75% confluency 
in 12 mm dishes were transfected with 1.0 ^ig of pBG/SIN-1 ELVS 1.5 or pBG/wt 
ELVS 1.5 expression vector plasmid DNAs complexed with 4.0 ^1 of a commercially 
available lipid (Lipofectamine, GIBCO-BRL). Otherwise, transfection conditions were 

20 as suggested by the lipid manufacturer. Eagle minimal essential medium supplemented 
with 5% fetal bovine sera was added to the cells at 4 hours post transfection (hpt), 
unless otherwise indicated. Transfected cells were incubated at 37°C. At various times 
post transfection, as indicated below, several assays were performed to compare vector- 
specific RNA synthesis, and expression of secreted alkaline phosphatase, luciferase, or 

25 p-galactosidase reporter gene expression irv cells transfected with pBG/SIN-1 ELVS 1 .5 
or pBG/wt ELVS 1 .5 plasmid DNAs. 

The levels of alkaline phosphatase secreted into the culture medium of 
BHK cells transfected with pBG/SIN-1 ELVS-1 1 .5-SEAP or pBG/wt ELVS 1 5-SEAP 
plasmid DNA were compared. Cell culture medium was assayed for the presence of 

30 alkaline phosphatase with the Phospha-Light™ chemiluminescent reporter gene assay, 
according to the directions of the manufacturer (Tropix, Inc., Bedford, MA). Briefly, 
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I0\i\ of cell culture supernatant was mixed with 30|al of Dilution Buffer and incubated 
for 30 minutes at 65°C. The sample was allowed to cool to room temperature before 
mixing with 40jil of Assay Buffer. The sample was incubated for five minutes at room 
temperature followed by the addition of 40^1 of Reaction Buffer. Samples were 
incubated for 20 minutes at room temperature. Total luminescence was measured on an 
ML3000 microtiter plate luminometer (Dynatech, Inc., Chantilly, VA) in cycle mode, 
and alkaline phosphatase (AP) in the culture medium of BHK cells transfected with 
pBG/SIN-1 ELVS-1 1.5-SEAP or pBG/wt ELVS 1.5-SEAP plasmid DNAs, were 
determined at 48 hpt, and the results are shown in the table below. 

Plasmid Transfected RLU at 48 hpt 
pBG/SIN- 1 ELVS 1 .5-SEAP 1 8 ± 1 .7 

pBG/wt ELVS 1 .5-SEAP 94 ±10.7 

pCDNA3 0.13 + 0.04 



Additionally, the levels of vector-specific RMAs synthesized in BHK-21 
cells transfected with pBG/SIN-1 ELVS 1.5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmids were determined by Northern blot analysis, exactly as described previously 

15 (Dubensky, supra.), at 48 hours post-transfection. The results of this experiment are 
shown in Figure 9A. Total cellular RNA was isolated from transfected BHK cells with 
Tri-Reagent as described by the manufacturer (Molecular Research Center, Inc., 
Cincinnati, OH). Total cellular RNA concentrations present in samples from 
transfected BHK cells were determined spectrophotometrically. Additionally, material 

20 * isolated from transfected cells was determined to be intact by electrophoresis of 0.5 ug 
of total cellular RNA through 0.7% agarose/TBE mini gels, stained 10 ul/ml of 
ethidium bromide. Northern blot analysis was performed according to Sambrook and 
Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). In 
order that RNA from all transfected samples could be visualized on a autoradiogram 

25 from a single Northern blot analysis, 2.5 ug and 30 ug of RNA were loaded per lane 
from pBG/wt ELVS 1.5-SEAP and pBG/SIN-l ELVS 1.5-SEAP transfected cells, 
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respectively. Four samples of RNA, from individual transfections with both plasmids 
tested, were electrophoresed through 0.7% formaldehyde agarose gels and transferred to 
Zeta-probe membrane (Bio-Rad, Richmond, CA). The blot was hybridized with 
random-primed probes corresponding to the alkaline phosphatase gene. The results of 
5 this experiment in which the levels of vector-specific RNA synthesis and AP expression 
in transfected BHK cells were compared at 48 hpt, demonstrate that while the level of 
vector-specific RNA synthesized in cells transfected with pBG/SIN-1 ELVS 1.5-SEAP 
DNA was at least 100-fold lower than in pBG/wt ELVS 1.5-SEAP transfected cells, the 
levels of AP were only 5-fold lower in cells transfected with pBG/SIN-1 ELVS 1.5- 
1 0 SEAP DNA, compared to pBG/wt ELVS 1 .5-SEAP DNA. 

The levels of alkaline phosphatase secreted into the culture medium of 
BHK cells transfected with pBG/SIN-1 ELVS-1 1.5-SEAP or pBG/wt ELVS 1.5-SEAP 
plasmid DNA were also compared over a 7 day time-course. The results of this study, 
illustrated in Figure 9B, demonstrate that the levels of AP present in the culture medium 
15 at early time points were much lower in cells transfected with pBG/SIN-1 ELVS-1 1.5- 
SEAP plasmid, compared to pBG/wt ELVS 1.5-SEAP plasmid. However, the level of 
AP expressed in cells transfected with the SIN-1 Sindbis virus variant strain-derived 
vectors rapidly increased and was higher than in cells transfected with wild-type virus- 
derived vectors by the 96 hpt time point. 
20 The luciferase levels present in BHK cells transfected with pBG/SIN-1 

ELVS-1 1.5-luc or pBG/wt ELVS 1.5-Iuc plasmid DNA were compared at 24, 48, and 
72 hpt. The luciferase expression levels were quantitated by adding 250 nl of reporter 
lysis buffer (Promega, Madison WI) per 10 6 transfected cells, centrifuging the lysate at 
* 14,000 rpm for 1 min, and then mixing the supernatant fraction from the cell lysates 
25 with a commercially available substrate detection system (Promega, Madison WI), 
followed by luminometry (Analytical Luminescence Laboratory, San Diego, CA). The 
results from this experiment (shown in the table below), and shown graphically in 
Figure 10, parallel the results observed with the alkaline phosphatase expression 
vectors. At early times post transfection the luciferase expression levels were lower in 
30 BHK cells transfected with pBG/SIN-1 ELVS 1.5-luc plasmid, compared to pBG/wt 
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ELVS 1.5-luc plasmid. However, at the .48 and 72 hpt time points, the luciferase levels 
were similar in BHK cells transfected with Sindbis virus SIN-1 variant strain- and wild- 
type-derived expression vectors. 



Plasmid Transfected 



Hr. Post 
Trans- 
fection 



Relative Light Units 
(Ave, + SD) 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wt ELVS 1 .5-luc 



24 



1.2 x 10 9 
2.1 x 10 s 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wtELVS 1.5-!uc 



48 



3.3 x 10' 
4.1 x 10' 



pBG/SIN-1 ELVS 1.5-luc 
pBG/wt ELVS 1 .5-luc 



72 



1.8 xlO 9 
5.8 x 10 9 



pCDNA3 



48 



482 



The efficiency of transfection of the pBG/SIN-1 ELVS 1.5-p-gal and 
pBG/wt ELVS 1.5-p-gal plasmids in BHK cells at 48 hpt was determined by direct X- 
gal (5-bromo-4-chloro-3-indolyl-b-D-galactopyranosidc) staining of the cell monolayer 
(MacGregor, Cell Mol. Genet. 75:253-265, 1987), in order to measure directly the 
10 number of cells expressing P-galactosidase. The transfection efficiencies were 
equivalent, and are shown in the table below. 



Plasmid Transfected 
pBG/SIN-1 ELVS 1.5-p-gal 
pBG/wtELVS 1.5-p-gal 



No. Blue Cells/1 00X Field 
24 + 3 
26 + 7 



15 



The levels of vector-specific RNAs synthesized in BHK-21 cells 
transfected with pBG/SIN-1 ELVS 1.5-p-gal or pBG/wt ELVS 1.5-p-gal plasmids 
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were determined by Northern blot analysis, exactly as described previously (Dubensky, 
supra.), at 48 and 72 hours post-transfection. Total cellular RNA was isolated from 
transfected BHK cells with Tri-Reagent as described by the manufacturer (Molecular 
Research Center, Inc., Cincinnati, OH). Total cellular RNA concentrations present in 
5 samples from BHK ceils transfected with pBG/SIN-1 ELVS 1.5-p-gal or pBG/wt 
ELVS 1.5-P-gal plasmids were determined spectrophotometrically. Additionally, 
material isolated from transfected cells was determined to be intact by electrophoresis 
of 0.5 ug of total cellular RNA through 0.7% agarose/TBE mini gels, stained 10 ul/ml 
of ethidium bromide. Northern blot analysis was performed according to Sambrook and 

10 Maniatis (1989, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). 
In order that RNA from all transfected samples could be visualized on a autoradiogram 
from a single Northern blot analysis, 2.5 ug and 5 ug of RNA were loaded per lane 
from pBG/wt ELVS 1.5-P-gal and pBG/SIN-1 ELVS 1.5-P-gal transfected cells, 
respectively. Two samples of RNA, from individual transfections with both plasmids 

15 tested, at 48 and 72 hpt, were electrophoresed through 0.7% formaldehyde agarose gels 
and transferred to Zeta-probe membrane (Bio-Rad, Richmond, CA). The blot was 
hybridized with random-primed probes corresponding to the P-galactosidase gene. The 
results of this experiment, shown in Figure 11 A, demonstrate that the level of vector 
specific RNA synthesized in cells transfected with pBG/SIN-1 ELVS 1.5-p-gal DNA at 

20 48 and 72 hours post transfection was at least 100-fold lower than the level of RNA 
detected in pBG/wt ELVS 1 .5-P-gal transfected cells. 

Additionally the p-galactosidase expression levels were quantitated in 
transfected whole cell lysates by adding 250 ^1 of reporter lysis buffer (Promega, 

Madison WI) per 10 6 transfected cells, centrifuging the lysate at 14,000 rpm for 1 min, 
25 and then mixing the supernatant fraction from the cell lysates with a commercially 
available substrate detection system (Clontech, Palo Alto, CA), followed by 
luminometry (Analytical Luminescence Laboratory, San Diego, CA). The results from 
this experiment (shown in the table below, and graphically in Figure 9C) demonstrate 
that at early times post transfection the p-galactosidase expression levels were 
30 significantly lower in BHK cells transfected with pBG/SIN-1 ELVS 1 .5-p-gal plasmid, 
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compared to pBG/wt ELVS 1.5-P-gal plasmid. However, reporter expression rapidly 
increased over the time-course in pBG/SIN-1 ELVS 1.5-p-gal transfected cells such 
that the P-galactosidase levels were higher than in wild-type virus transfected cells at 
the final 120 hr time point. 

Plasmid Transfected Hr. Post Relative Light Units 

Trans- (Ave, + SD) 

fection 

pBG/SIN-1 ELVS 1 :5-B-gal 24~^ 54030 + 7348 

pBG/wt ELVS 1 .5-p-gal 4801 590 ± 74,425 



pBG/SIN-1 ELVS 1.5-p-gal 
pBG/wtELVS 1.5-p-gal 

pBG/SIN-1 ELVS 1.5-p-gal 
pBG/wtELVS 1.5-p-gal 



48 



72 



310830 + 31083 
2214921 +248071 

1443474 + 98156 
3793524 + 857336 



pBG/SIN-1 ELVS 1.5-p-gal 
pBG/wtELVS 1.5-p-gal 



96 



2232585 + 299166 
3514262 + 548225 



pBG/SIN-1 ELVS 1.5-p-gal 
pBG/wtELVS 1.5-p-gal 



120 



3200910+ 128036 
1986537+ 166869 



pCDNA3 



3637 



Expression of P-galactosidase in cells transfected with the pBG/SIN-1 
ELVS-1 1.5-p-gal or pBG/wt ELVS 1.5-B-gal plasmid DNAs was also measured 
directly by Western blot analysis using a monoclonal antibody specific for the reporter 
10 protein (Boehringer Mannheim), at the final 120 hpt time point In parallel with the 
reporter protein activity determined at 120 hpt, the level of P-galactosidase protein was 
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at the same level, or greater, in PBG/SIN-1 ELVS-1 1.5-p-gal transfected BHK cells, 
compared to pBG/wt ELVS 1.5-p-gal, and is demonstrated in Figure 1 1C. 

Taken together, the results described herein demonstrate that the level of 
vector-specific RNA synthesized is at least 100-fold less in pBG/SIN-1 ELVS 
5 transfected cells, compared to pBG/wt ELVS transfected cells. Importantly however, 
after a 48-72 hour lag, the levels of reporter gene expression are equivalent, or higher, 
in pBG/SIN-1 ELVS transfected cells, compared to pBG/wt ELVS transfected cells. 
The phenotype of pBG/SrN-1 ELVS, characterized by high expression levels combined 
with low vector-specific RNA synthesis in transfected cells, is due likely to the 

10 diminished, or absent, inhibition of host cell protein synthesis. This property of 
pBG/SIN-1 ELVS thus results in much higher levels of expressed reporter protein per 
subgenomic mRNA translation template in transfected cells, compared to pBG/wt 
ELVS. In summary, the phenotype of the plasmid DNA expression vectors derived 
from the SIN- 1 variant strains follows the parent virus, in terms of equivalent 

15 expression levels, combined with relatively low levels of RNA synthesis, compared to 
wild-type virus derived-vectors. As vectors do not contain any of the Sindbis virus 
structural proteins, this phenotype must map to the nonstructural genes of the SIN-1 
virus variant. 



20 

EXAMPLE 5 

Modifications of plasmid DNA SIN-1 derived expression vectors 

Expression levels of heterologous genes in target cells from alphavirus- 
25 based vectors are effected by several factors, including host genus and vector 
configuration. For example, P-galactosidase expression levels are 10- to 100-fold 
higher in BHK cells, compared to some human cells, such as HT1080, transfected with 
pBG/ELVS vectors. The levels of reporter gene expression in BHK and several human 
cell lines transfected with pBG/wt ELVS 1.5-pgal plasmid DNA (see example 4) were 
30 compared in order to establish the relative level of vector-specific expression in cell 
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types derived from the intended in vivo target genus. The levels of p-galactosidase 
expression in BHK cells and HT1080 (ATCC CCL 121) cells, a human fibrosarcoma 
line, transfected with pBG/wt ELVS 1.5-pgal plasmid, or with a conventional plasmid 
expression vector, were determined. The conventional plasmid vector was constructed 
5 by insertion of the lac Z gene (Promega, Madison, Wl) into the CMV promoter-driven 
pUC-derived expression plasmid multiple cloning site (Invitrogen, San Diego, CA), and 
is known as pCMV-p-gal. The results of this study, given in Figure 12 A, demonstrate 
that the level of P-galactosidase expression was nearly 100-fold lower in HT1080 cells 
transfected with pBG/wt ELVS 1.5-pgal plasmid DNA, compared to BHK cells. Cells 

10 were also transfected with pCMV-p-gal in order to segregate RNA polymerase II 
expression from Sindbis virus vector replicon expression. In this experiment, while the 
expression decreased 5- to 1 0-fold in HT1080 cells transfected with pCMV-p-gal 
plasmid, compared to BHK cells, expression decreased nearly 100-fold in HT 1080 
cells transfected with pBG/wt ELVS 1 5-pgal plasmid DNA. Thus, the results indicate 

15 that the dramatic decrease of reporter gene expression in HT1 080 cells transfected with 
pBG/wt ELVS 1.5-Pgal plasmid DNA is due in part to the diminished activity of the 
Sindbis virus vector replicon in these human cells. 

Given the overall plasticity of the RNA alphaviral genome and the 
propagation of virus in BHK cells, it is not surprising that the expression levels of 

20 heterologous genes are highest in the host cell lines from which the vectors were 
derived. Thus, selection of alphaviruses with the SIN-1 phenotype (as described in 
Examples 1 and 2), characterized by comparatively low viral RNA levels and equivalent 
virus production levels, combined with delayed or absent inhibition of host cell protein 
synthesis, can be performed in any human primary, or diploid or polyploid human cells. 

2 ^ In addition to selecting alphaviruses with desired phenotypes in cells 

(e.g., human) which more closely parallel target cells in vivo, several alternative 
modifications of the prototype plasmid DNA expression cassette components can also 
be performed. For example, substitution of the MoMLV RNA polymerase II promoter 
with the stronger CMV immediate early (IE) promoter significantly enhances the level 

30 of heterologous gene expression in transfected cells (Dubensky et al., 1 Virol 70:508- 



WO 97/38087 



PCT/US97/06010 



139 

519, 1996, and Dubensky et al., W/0 95/07994). Further, juxtaposition of introns, for 
example SV40 small t antigen or CMV intron A, either upstream or downstream from 
the heterologous gene, can increase the level of heterologous gene expression in some 
transfected cell types (Dubensky et al., supra). 
5 Several further alternative modifications of the prototype plasmid DNA 

expression cassette components can also be utilized in order to enhance the overall 
expression in transfected cells in vitro or in vivo. In one modification, the Hepatitis B 
virus (HBV) posttranscriptional regulatory element (PRE) was inserted in the pBG/wt 
ELVS 1.5-Pgal plasmid DNA. The PRE sequence activates the transport of HBV S 

10 transcripts in cis from the nucleus to the cytoplasm. The PRE sequence appears to 
function independently of splice donor and acceptor sites, and has been shown to 
activate cytoplasmic expression of a (}-globin transcript not containing introns. It has 
been proposed that the PRE functions in cis to allow the export of nuclear transcripts 
that do not interact efficiently with the splicing pathway and hence are not exported 

15 well from the nucleus (Huang et al., Molecular and Cellular Biology 75:3864-3869, 
1995). 

The PRE sequence was cloned into pBG/SIN-1 ELVS 1.5-pgaI by 
isolating first a PCR-generated 564 bp fragment of HBV from the full length genomic 
clone of the ADW viral strain, pAM6 (ATCC No. 39630). The amplified fragment 
20 extends from base 1238-1802 of the HBV genome. The primer sequences are given 
below. 



25 



Forward Primer: NTPRE1238F (SEQ. ID NO. 46) 

5 ! -CCTATGCGGCCGCGTGGAACCTTTGTGGCTCCTC 



Reverse Primer: EAPRE1802R (SEQ. ID NO. 47) 
S'-CCTATTGGCCAGCAGACCAATTTATGCCTAC 



30 
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The primers introduce a Not I recognition site at the 5' end of the 
fragment and an Eae I recognition site at the 3' end. Not 1 and Eae 1 have compatible 
sticky ends. The Eae I recognition site is internal to the Not I site, so Eae I cuts both 
sites. 

5 The PCR fragment was digested with Eae I and cloned into Not 1 

digested/CIAP treated pBG/wt ELVS 1.5-pgal. The correct clone retains the Not I site 
at the 5" terminus of the PRE and is called pBG/wt ELVS/PRE 1 .5-pgal. 

The possible effect of the PRE sequence contained in ELVS plasmids on 
heterologous gene expression in transfected cells was determined. Briefly, BHK and 

10 HT1080 cells were cultured in 12 mm dishes to 75% confluency and transfected with 
500 ng of pCMV-P-gal, pBG/wt ELVS 1.5-Pgal, or pBG/wt ELVS/PRE 1. 5-pgal 
plasmid DNA complexed with Lipofectamine (GIBCO-BRL, Gaithersburg, MD), and 
the level of p-galalactosidase expression was determined 48 hr later. Transfection 
efficiencies were determined by direct Xgal staining of transfected monolayers, as 

1 5 described in Example 4, and are shown in the table below. 

No. Blue cells/12 mm Dish 



Construct BHK HT1080 

pCMV-B-gal 549 42 

pBG/wt ELVS 1 .5-Pgal 1 46 1 

pBG/wt ELVS/PRE 1 .5-pgal 334 33 

Mock 0 0 



The results demonstrate clearly that the number of BHK or HT1080 cells transfected 
with ELVS plasmids expressing P-galalactosidase was increased dramatically by 
- inclusion of the RNA transport PRE sequence in the vector. Further, these results 
20 indicate that one cause for the diminished heterologous gene expression levels in 
HT1080 cells, compared to BHK cells, transfected with ELVS plasmid DNA is the 
inefficient transport of the primary transcript from the nucleus. 

In parallel with the higher frequency of reporter protein expressing 
HT1080 cells transfected with ELVS plasmids containing the PRE sequence, the levels 
25 of P-galalactosidase were dramatically higher in lysates from HT1080 cells transfected 
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with ELVS vectors containing the PRE sequence. These results are illustrated in 
Figure 12B, and taken together with the results shown in the table above, demonstrate 
that functional vector replicons are transported inefficiently from the nucleus in human 
cells transfected with ELVS plasmids. Further, inclusion of the PRE sequence in the 
5 ELVS plasmid construct increases the level of heterologous gene expression in all cells 
tested, demonstrating a clear relationship between efficiency of cytoplasmic vector 
replicon transport and overall heterologous gene expression level. 

Several other viral sequence elements which operate in cis to transport 
unspliced RNAs have also been identified. For example, a 219 bp sequence, located 

10 between nts 8022 and 8240 near the 3 T end of the Mason-Pfizer monkey virus (MPMV) 
genome, has been shown to enable Rev independent human immunodeficiency virus 
type 1 (HIV-1) replication (Bray et al., PNAS 97:1256-1260, 1994). The MPMV RNA 
transport element, known as the constitutive transport element (CTE), is inserted into 
the pBG/SIN-1 ELVS 1.5-0gal plasmid by first isolating a PCR-generated 219 bp 

15 fragment of MPSV from the full length genomic clone template (Sonigo et al., Cell 
45:375-385, 1986), or the MPSV subgenomic clone pGEM7FZ(-)MPSV 8007-8240 (D. 
Rekosh, Ham-Rek Laboratories, SUNY at Buffalo, 304 Foster Hall, Buffalo, New 
York). The amplified fragment extends from base 8022-8240 of the MPSV genome. 
The primer sequences are given below. 

20 

Forward Primer: NMPVM8021F (SEQ. ID NO. 48) 

5 r -CCTATGCGGCCGCTAGACTGGACAGCCAATGACG 



25 Reverse Primer: EMPMV8241R (SEQ. ID NO. 49) 

5-CCTATTGGCCAGCCAAGACATCATCCGGGCAG 



30 



The primers introduce a Not I recognition site at the 5' end of the 
fragment and an Eae I recognition site at the 3' end. Not I and Eae I have compatible 
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sticky ends. The Eae I recognition site is internal to the Not I site, so Eae I cuts both 
sites. 

The PCR fragment is digested with Eae I and cloned into Not I 
digested/CIAP treated pBG/wt ELVS 1.5-Pgal. The correct clone retains the Not I site 
5 at the 5* terminus of the PRE and is called pBG/wt ELVS/CTE 1 .5-{Jgal. 

In addition to the HBV PRE and MPSV CTE sequences, several RNA 
transport elements from other viral or cellular sources can be inserted into the ELVS 
plasmid constructs, as described above. For example, some of these elements include 
the HIV Rev responsive element (Malim et al., Nature. 3J5:254-257, 1989), the HTLV 
10 1 Rex element (Ahmed et. al, Genes Dev., 4:1014-1022, 1990), and another exacting 
sequence from simian retrovirus type 1 (Zolotukhin et al., J. Virol, 65:7944-7952, 
1994). In addition, each of the above RNA transport elements also may be incorporated 
into the structural protein expression casettes, packaging cell lines, or producer cell 
lines described in Examples 6 and 7. 

1 5 I" yet another modification of prototype ELVS vectors, expression of the 

alphavirus replicons can be driven from an RNA polymerase I promoter. Briefly, 
because RNA polymerase I promoters are not tissue specific and are expressed in 
essentially all human cells in the body, they provide an attractive alternative for plasmid 
DNA-directed alphavirus replicon expression in transfected cells. For example, the 

20 human rDNA promoter (plasmid prHU3, Learned and Tjian, J. Mol AppL Gen.. 7:575- 
584, 1982), has been used to construct a vector for heterologous gene expression 
(Palmer et al., Nuc. Acids Res. 27:3451-3457, 1993). 

Thus, wihtin one embodiment of the invention a RNA polymerase I 
- promoter can be juxtaposed with the 5' end of the replicon cDNA such that the first 

25 nucleotide transcribed in transfected cells corresponds to the authentic alphavirus 5* end. 
Identification of the RNA polymerase I promoter (e.g., plasmid prHU3) nucleotide at 
which transcription initiation occurs is determined as described previously (Dubensky et 
al., W/O 95/07994). 

All modifications described herein can be performed with ELVS 
30 constructs containing the Sindbis virus wild-type or SIN- 1 nsPs, or nsP genes from any 
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alphavirus. For example, all of the constructions provided in Example 5 can also be 
performed with plasmid pBG/SIN-1 ELVS 1.5-Pgal, whose construction is described in 
Example 4. 

5 EXAMPLE 6 

Construction of Alphavirus Packaging Cell Lines 

In the present invention, alphavirus packaging cell lines (PCL) are 
provided, whereby the virus-derived structural proteins necessary for RNA packaging 

1 0 and formation of recombinant alphavirus vector particles are encoded by one or more 
stably transformed structural protein expression cassette(s). Synthesis of these proteins 
preferably occurs in an inducible manner, and in particularly preferred embodiments, 
via transcription of subgenomic mRNA from their native "junction region" promoter. 
Inducible subgenomic transcription is mediated by the input alphavirus vector RNA 

15 itself (Figure 13). Following primary transcription from the structural protein 
expression cassette(s), cytoplasmic amplification of the RNA transcript is initiated *by 
vector-encoded nonstructural proteins, and ultimately leads to transcription from the 
junction region promoter and high level structural protein expression. The structural 
protein expression casettes may include any of the previously described elements of the 

20 present invention, including RNA transport elements (e.g., HBV PRE and MPMV CTE) 
and splicing sequences. Such PCL and their stably transformed structural protein 
expression cassettes can be derived using methods described within PCT application 
WO 95/07994, or using novel approaches described within this invention. PCL may be 
- derived from almost any existing parental cell type, including both mammalian and 

25 non-mammalian cells. Preferred embodiments for the derivation of PCL are cell lines 
of human origin. 

A. Construction of Vector-Inducible Alphavirus PCL 

For example, an alphavirus structural protein expression cassette was 
30 constructed, whereby primary transcription from a CMV immediate early promoter 
produces an RNA molecule capable of efficient cytoplasmic amplification and 
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structural protein expression only after translation of nonstructural replicase proteins 
from the vector RNA. Specifically, plasmid pDCMV-dlnsPSIN (Dubensky el al., 1 
Virol 70:508-519, 1996), a DNA-based Sindbis defective helper (DH) vector, was 
modified to contain both a hepatitis delta virus (HDV) antigenomic ribozyme sequence 
5 (Perotta and Been, Nature 550:434-436, 1991) for 3'-end RNA processing, and an SV40 
small t antigen intron inserted within the region of nonstructural protein gene deletion. 
Due to restriction site duplications associated with insertion of the HDV ribozyme 
sequence, an additional plasmid from Dubensky et al, (ibid), pDLTRSINgHDV, was 
used as starting material to reconstruct the modified CMV-based DH construct. 

10 Plasmid pDLTRSINgHDV, an LTR-based Sindbis genomic clone containing the HDV 
ribozyme, was digested with Bgl II to remove the existing LTR promoter and Sindbis 
nucleotides 1-2289 (numbering according to Strauss et al.. Virology 133:92-1 10, 1984), 
treated with calf intestinal alkaline phosphatase, and purified from a 0.7% agarose gel 
using GENECLEAN II™ (Biol 01, San Diego, CA). The corresponding 5'-end 

1 5 fragment with a CMV promoter was obtained by Bgl II digestion of the Sindbis 
genomic clone pDCMVSINg (Dubensky et al., ibid) and purification from a 1% agarose 
gel using GENECLEAN II, and then ligated into the Bgl //-deleted pDLTRSINgHDV 
vector to generate the construct pDCMVSINgHDV. This CMV-based genomic plasmid 
with an HDV ribozyme was shown to produce infectious Sindbis virus and cytopathic 

20 effect within 24 hr after transfection into BHK cells. Defective helper plasmid 
pDCMVdlnsPSINgHDV, containing the HDV ribozyme, was then constructed by BspE 
I digestion and relegation under dilute conditions, to remove nonstructural gene 
sequences between nucleotides 422 and 7054. Subsequently, the SV40 intron was 
* synthesized by PCR and inserted into the region of nonstructural protein gene deletion. 

25 Amplification of the SV40 intron sequence was accomplished by standard three-cycle 
PCR with a 30 second extension time, using plasmid pBR322/SV40 (strain 776, ATCC 
#45019) as template and the following oligonucleotide primers that were designed to 
contain flanking BspE I or Bam HI sites. 
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Forward primer: BspSVSDF (Surest. site/SV40 intron sea.) (SEQ. ID. NO. 50) 
S-TATATATCCGGA/AAGCTCTAAGGTAAATATAAAATTTTT-S' 

Reverse primer: BamSVSAR (5'-rest. site/SV40 intron sea.) (SEQ. ID. NO. 51) 
5 S'-TATATAGGATCC/TAGGTTAGGTTGGAATCTAAAATACACAAAC-S* 

Following amplification, the DNA fragment was purified using a QIAquick-spin PGR 
purification kit (Qiagen, Chatsworth, CA), digested with BspE I and Bam HI, purified 
from a 1.2% agarose gel using Mermaid™ (Biol 01, San Diego, CA), and ligated into 

10 the defective helper plasmid pDCMV-dlnsPSIN, which was also digested with BspE I 
and Bam HI, treated with calf intestinal alkaline phosphatase, and purified from a 0.7% 
agarose gel using GENECLEAN II, to generate the construct pDCMV-intSINrbz 
(Figure 14). Plasmid pDCMV-intSINrbz, which also contains an SV40 promoter- 
driven neomycin resistance selectable marker on another portion of the plasmid, was 

1 5 transfected into BHK cells using Lipofectamine™ (Gibco/BRL, Gaithersburg, MD), as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells were 
trypsinized and re-plated in media containing 600 ug/ml G418 (neomycin). The media 
was exchanged periodically with fresh G41 8-containing media and foci of resistant cells 
were allowed to grow. Cells were trypsinized and cloned by limiting dilution in 96 well 

20 tissue culture dishes, and individual cell clones were grown and expanded for screening. 

Positive packaging activity for the individual clones was identified by 
Lipofectin™ (Gibco/BRL, Gaithersburg, MD)-transfection with Sindbis vector RNA 
that expresses a luciferase reporter gene (described in Dubensky et al., ibid), harvesting 
the culture supernatants at approximately 24 hr post-transfection, and assaying for the 

25 presence of packaged Sindbis-luciferase vector particles. In addition, initial transfection 
levels were determined by harvesting the transfected cell lysates using reporter lysis 
buffer (Promega, Madison, WI), and testing for the presence of luciferase activity by 
using luciferin substrate (Promega), as described by the manufacturer. To assay for 
packaged vector particles in the culture supernatants, 1 ml of undiluted, clarified 

30 supernatant was used to infect fresh BHK cell monolayers for approximately 1 8 hr. The 
cells were subsequently lysed as above, and luciferase activity was determined. The 
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presence of luciferase activity in the infected BHK cells was confirmation of packaged 
vector particles in the transfected cell supematants. Several positive cell clones 
harboring integrated copies of the pDCMV-intSINrbz structural protein expression 
cassette, and functioning as PCL, were identified. Packaging data for two of the 
5 individual PCL clones (# 1 0s- 1 9 and U 1 Os-22), that were representative of the group, are 
shown in Figure 15. In addition, the titer of packaged vector particles being produced 
was determined by transfecting an individual PCL clone (#1 Os-22) with SIN-p-gal 
vector RNA (described in Dubensky et at, ibid). The culture supernatant was recovered 
at 48 hr post-transfection, clarified by passage through a 0.45 mm filter, and fresh BHK 

10 monolayers were infected with 10-fold dilutions of the supernatant. Approximately 14 
hr post-infection, the cells were washed with PBS. fixed with 2% formaldehyde, 
washed again with PBS, and stained with X-gal. Vector particle units were then 
determined by counting individual blue-stained cells. Packaged (J-gal vector titers from 
this PCL clone were approximately 10 6 infectious units/ml of supernatant. Vector- 

15 controlled inducibility of Sindbis structural protein expression was demonstrated by 
western blot analysis using a polyclonal rabbit antiserum specific for the structural 
proteins. Positive 1 Os-22 PCL and negative control BHK cell lysates were made in 
Lameli sample buffer, either before (U; uninduced) or after (I; induced) transfection 
with SIN-p-gal vector RNA. As shown in Figure 16, the only cell lysate that showed 

20 expression of Sindbis structural proteins was the 1 Os-22 PCL clone after transfection 
with vector RNA. Differences in the apparent levels of expression between the capsid 
protein and envelope glycoproteins do not reflect the actual amounts of protein being 
made, rather, the lower stability of the envelope glycoproteins during the cell lysis 
* procedure used for this particular experiment. 

25 Packaging activity of the CMV-based DH construct was also highly 

efficient in non-mammalian cells, for example, C6/36 mosquito cells. The use of such a 
non-mammalian parental cell type for derivation of PCL may be particularly 
advantageous when the PCL are intended for subsequent use as starting material for the 
generation of vector producer cell lines. The advantage of this cell type is the natural 

30 ability of alphaviruses to establish a persistent infection, without the mammalian cell- 
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associated phenotype of inhibition of host macromolecular synthesis and resulting 
cytopathic effect (CPE). Thus a DNA-based alphavirus vector (Examples 4 and 5), with 
an appropriate selectable marker, may be stably transformed into mosquito or other 
non-mammalian cell-derived PCL. Figure 17A shows that both a DNA-based luciferase 
5 reporter vector and DH helper vector expressing Sindbis structural proteins, under the 
control of the CMV promoter, were fully functional in C6/36 cells, as demonstrated by 
luciferase vector packaging. 

B. Construction of PCL with operablv-linked selection marker 

10 In other embodiments of the present invention, a selectable marker is 

operably linked to transcription of the alphavirus structural protein expression cassette. 
In preferred embodiments, this operable linkage is accomplished either by insertion of 
the marker into the region of nonstructural protein gene deletion, as a fusion with 
remaining nsPl amino acids, or by insertion downstream of the structural protein genes, 

15 under the translational control of an internal ribosomal entry site (IRES) sequence. 
Again, amplification of the primary structural protein gene mRNA transcript »and 
induction of structural protein expression is controlled by the input vector RNA 
molecule and its synthesized nonstructural proteins. 

Specifically, for construction of the structural protein expression 

20 cassette, plasmid pBGSI31 (Spratt et al, Gene 47:337-342, 1986; ATCC #37443) was 
modified to remove extraneous sequences, and to render an existing Xho I site within 
the kanamycin resistance gene non-functional. Plasmid pBGS131 was digested with 
Xho I and a synthetic double-stranded oligonucleotide linker with Xho I-compatible 
ends was ligated into the site. The synthetic 12-mer oligonucleotide, shown below, was 

25 designed as a partial palindrome that would anneal to itself generating Xho I sticky ends 
for ligation, and maintaining the kanamycin resistance gene open reading frame by 
inserting four in-frame amino acids. 



dUftolinker (SEQ. ID. NO. 52) 
30 5'TCGATCCTAGGA 
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Insertion of this oligonucleotide resulted in a Xho I site-deleted plasmid, designated 
pBGS131dLW»o/. The plasmid was next digested with BspH I and religated to itself 
under dilute conditions to remove 829 bp of extraneous sequence between the ColEl 
5 replicon and kanamycin resistance marker, generating the plasmid pBGSBldlB. The 
BspH I site next was changed to a Pac 1 site by digesting pBGS131dlB with BspH I, 
making the termini blunt with Klenow enzyme and dNTPs, and ligating with excess 
Pac I linker. 

10 Pac I linker (SEQ. ID. NO. 53) 
5'-GCTCTTAATTAAGAGC 

This new construct, designated pBGS131dlB-P, was further modified by digesting with 
Fsp I and Pvu II to remove an additional 472 bp, including the multiple cloning site 
1 5 (MCS) and purifying the remaining vector from a 1% agarose gel using GENECLEAN 
II. A replacement MCS was inserted into the modified vector by annealing two 
complimentary oligonucleotides, PME.MCS1 and PME.MCSII, and ligating with the 
linear plasmid. 

20 PME.MCSI (SEQ. ID. NO. 54) 

5'-CTGTTTAAACAGATCTrATCTCGAGTATGCGGCCGCTATGAATTCGTrTAAACGA-3' 

PME.MCSII (SEQ. ID. NO. 55) 

S'-TCGTTTAA ACGAATTCATAGCGGCCGCATACTCG AG ATA AG ATCTGTTTAA ACAG-3' 

25 

The new, approximately 2475 bp, cloning vector was designated 
pBGSVG, and contained the following multiple cloning site: <Pme I - Bgl II - Xho I - 
Not I - EcoR I - Pme I>. Insertion of the structural protein expression cassette 
containing an operably linked selectable marker proceeded stepwise, as follows. A 
30 DNA fragment comprising the 3'-end of Sindbis virus, a synthetic A40 tract, the 
antigenomic HDV ribozyme, and a BGH transcription termination signal, was removed 
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from plasmid pBG/SIN-1 ELVS 1.5 (Example 5) by digestion with Not I and EcoR /, 
: and purification from a 1% agarose gel using GENECLEAN II. Plasmid pBGSVG also 

was digested with Not I and EcoR /, purified from a 1% agarose gel using 
GENECLEAN II, and ligated with the purified 3'-end/A40/HDV/BGH fragment, to 
5 generate the construct pBGSV3\ Next, an approximately 9250 bp Sindbis cDNA 
fragment, containing the structural protein genes and much of the nonstructural protein- 
encoding region, was removed from plasmid pDLTRSINg (Dubensky et al, ibid) by 
digestion with Bgl II and Fsp I, and purified from a 0.7% agarose gel using 
GENECLEAN II. The Sindbis cDNA fragment was then ligated into plasmid 

10 pBGSV3\ which was also digested with Bgl II and Fsp I, treated with alkaline 
phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. The new 
construct was designated pBGSV3 r BF. Subsequently, this construct was digested with 
Bgl //, treated with alkaline phosphatase, and purified with GENECLEAN II for 
insertion of remaining 5'-end and nonstructural gene sequences, along with a CMV IE 

15 promoter. The remaining sequences were obtained by digestion of 
plasmidpDCMVSINg (Dubensky et al., ibid) with Bgl II, purification of the fragment 
from a 1% agarose gel using GENECLEAN II, and ligation with the linear pBGSV3 , BF 
vector, to create the CMV-driven Sindbis genomic construct, pBGSVCMVgen. 
Functionality of this construct for initiation of the Sindbis virus replication cycle was 

20 determined by Lipofectamine-mediated transfection of pBGSVCMVgen plasmid into 
BHK cells, and the observance of CPE within 24 hr post-transfection. 

Plasmid pBGSVCMVgen was subsequently used to construct a DH 
structural protein expression cassette by deleting most of the nonstructural protein gene 
sequences and inserting a neomycin resistance gene as an in-frame fusion with 

25 remaining codons of the nsPl open reading frame. Briefly, the neomycin resistance 
gene was amplified by standard three-cycle PCR from the pcDNA3 vector (Invitrogen, 
San Diego, CA), using the following oligonucleotide primers that were designed to 
contain flanking BspE I and BamH I sites. 

* 

1 

1 
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Forward primer: NEQ5TU SE forest, site/neo eene^ (SEQ. ID. NO. 56) 
5-AfATATCCGGA/GTCCGGCCGCTTGGGTGGAGAGGCTA 

Reverse pri mer: NEQ3'BAM r5'-rest. site/neo gene^ (SEQ. ID. NO. 57) 
5 5-ATATAGGATCCTrCAGAAGAACTCGTCAAGAAGGCGA 

Following amplification, the DNA fragment was purified with QIAquick-spin, digested 
with BspE I and BamH I, purified using GENECLEAN II, and ligated into plasmid 
pBGSVCMVgen that had also been digested with BspE 1 and BamH I treated with 

10 alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. 
The resulting construct was designated pBGSVCMVdlneo, and is shown schematically 
in Figure 14. The configuration of pBGSVCMVdlneo includes, as part of the structural 
protein expression cassette and controlled by the same CMV promoter, a fusion protein 
comprising the initiator methionine and amino-terminal 121 amino acids of nsPI and 

1 5 the neomycin resistance gene lacking its methionine initiator codon and next ten amino 
acids. 

Plasmid pBGSVCMVdlneo was transfected into BHK cells using 
Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfusion, the cells were trypsinized and re-plated in media containing 600 ug/ml of 

20 the drug G4 18 (neomycin). The media was exchanged periodically with fresh G4 18- 
containing media and foci of resistant cells were allowed to grow. Cells were 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones were grown and expanded for screening. Positive packaging 
. activity for the individual clones was identified by transfecting with Sindbis luciferase 

25 vector RNA and assaying for the presence of packaged Sindbis-luciferase vector 
particles as described in the previous section. Several positive cell clones harboring 
integrated copies of the pBGSVCMVdlneo structural protein gene expression cassette, 
and functioning as PCL, were identified. SNBS™-luciferase vector packaging data for 
individual clones (Fl 1, F 13, F 15) that are representative of the group, as well as the 

30 previously described 1 0s-22 PCL line, are shown in Figure 1 8. 
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In addition to demonstrating functional packaging activity with Sindbis- 
lucifease vectors, additional experiments performed using the same PCL also showed 
that vectors derived from other alphavimses also could be packaged. For example, both 
Sindbis (Dubensky etal., ibid.) and Semliki Forest (pSFV3-/acZ; GIBCO BRL, 
5 Gaithersburg, MD) vector RNAs expressing P-galactosidase, were transfected into the 
F15 packaging cell line. Approximately 48 hr post-transfection, the culture 
supernatants were harvested, clarified, diluted serially, and used to infect fresh BHK 
cell monolayers for determination of vector particle titers. At 1 8 hr post-infection, the 
BHK cells were fixed* stained with X-gal, and the blue-staining cells were counted, as 

10 described previously. The vector titers obtained for Sindbis P-gal were approximately 
5xl0 6 IU/ml, while the titers for SFV p-gal were approximately 4xl0 6 IU/ml. These 
data demonstrate that the two different alphavimses and their corresponding vectors 
have similar packaging signals and that PCL derived for the Sindbis systems described 
herein are fully functional when used with another alphavirus. 

15 Packaging activity of the pBGSVCMVdlneo construct also was highly 

efficient in cells of human origin, for example, 293 cells. The use of such a human 
parental cell type for derivation of PCL may be particularly advantageous in the 
generation of complement resistant recombinant alphavirus particles. Figure 17B 
shows that both RNA and DNA-based luciferase reporter vectors were efficiently 

20 packaged following transfection into G418 resistant, pBGSVCMVdlneo-transformed 
293 PCL, as demonstrated by supernatant transfer of luciferase expression into BHK 
cells. 

Another selectable drug-resistance marker also was shown to function in 
a similar PCL configuration, as a fusion protein with remaining nsPl amino acids at its 
25 N-terminus. Briefly, the hygromycin phosphotranferase gene (hygromycin resistance 
marker, hygro*) was substituted into plasmid pBGSVCMVdlneo, in place of the existing 
neomycin resistance marker. The hygro r gene was amplified by standard three-cycle 
PCR from plasmid p3'SS (Stratagene, La Jolla, CA), using the following 
oligonucleotide primers that were designed to contain flanking EcoRV and BamHl sites. 

30 
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Forward pri mer: 5'HYGROEV (S'-rest. site/hvgro pene) (SEQ. ID. NO. 112) 
S'-TATATGATATC/AAAAAGCCTGAACTCACCGCGACG 

Reverse pri mer: 3'HYGROBA (S'-rest site/hvgro p ene) (SEQ. ID. NO. 1 13) 
5 5-ATATAGGATCC/TCAGTTAGCCTCCCCCATCTCCCG 

Following amplification, the DNA fragment was purified with QlAquick-spin, digested 
with EcoRV and BamHl, purified using GENECLEAN, and ligated into plasmid 
pBGSVCMVdlneo that had been digested with BspEl, blunt-ended with Klenow, 

10 digested further with BamM, treated with alkaline phosphatase, and purified from a 
0.7% gel using Geneclean. The resulting construct was designated pBGSVCM Vdlhyg. 

Plasmid pBGSVCMVdlhyg was transfected into BHK cells using 
Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfusion, the cells were trypsinized and re-plated in media containing 1 .2 mg/ml of 

1 5 the drug hygromycin (Boehringer Mannheim). The media was exchanged periodically 
with fresh hygromycin-containing media and foci of resistant cells were allowed to 
grow into a pool. Functionality of the selected packaging cells was demonstrated by 
transfecting with Sindbis luciferase vector RNA and assaying for the presence of 
packaged Sindbis-luciferase vector particles as described in the previous section. 

20 Positive results from these packaging experiments are shown in Figure 39. 

In an alternative packaging cell line structural protein expression 
cassette, the selectable marker (in this case neomycin resistance) was inserted 
downstream of the Sindbis structural protein genes and under the translational control of 
- an internal ribosome entry site (IRES). Thus, transcription of the mRNA encoding 

25 neomycin resistance occurs both at the genomic level (from the RSV promoter) and also 
from the subgenomic junction region promoter. Additional features unique to this 
construct include the Rous sarcoma virus (RSV) LTR promoter for primary 
transcription and a tRNA Asp 5'-end sequence derived from Sindbis defective-interfering 
RNA clone DI25 (Monroe and Schlesinger, Proc. Natl Acad. Sci USA 50:3279-3283, 

30 1983). This particular PCL expression cassette configuration was designated 
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987DHBBNeo, and is shown schematically in Figure 14. Specifically, plasmid 
987DHBBNeo is constructed stepwise using the modified plasmid vector pBGS!31dlB- 
P (described above) as starting material. A cDNA fragment containing the junction 
region promoter, the structural protein gene sequences, and 3-untranslated region + 
5 polyA is obtained by digestion of the full-length Sindbis cDNA clone pRSINg 
(Dubensky et al., ibid) with BamH I and Xba I, and purification from a 0.7% agarose gel 
using GENECLEAN II. The Sindbis cDNA DNA fragment is ligated with plasmid 
vector pBGS131dlB-P that also has been digested with BamH I and Xba A treated with 
alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II, to 

1 0 generate the construct pBGSINsp. 

Next, the transcription termination signal from the SV40 early region is 
inserted between the Sac I and Eco RI sites of pBGSINsp, immediately downstream of 
the Sindbis sequence. The SV40 viral nucleotides 2643 to 2563, containing the early 
region transcription termination sequences, are isolated by PCR amplification using the 

1 5 primer pair shown below and the pBR322/S V40 plasmid ( ATCC # 450 1 9), as template. 

Forward primer: FSVTT2643 (5'-rest. site/SV40 nts 2643-2613) (SEQ. ID. NO. 58) 
5'-TATATATGAGCTCTTACAAATAAAGCAATAGCATCACAAATTTC 

20 Reverse primer: RSVTT2563 (rest. site/SV40 nts 2563-2588) (SEQ. ID. NO. 59) 
5'-TATATGAATTCGTTTGGACAAACCACAACTAGAATG 

The primers are used in a standard three-cycle PCR reaction with a 30 
- second extension period. The amplification products are purified with QIAquick-spin, 
25 digested with Sac I and Eco RI, purified again with the Mermaid kit, and the 90 bp 
fragment is ligated into plasmid pBGSINsp that also has been digested with Sac I and 
EcoRl, treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. This construction is known as pBGSINspSV. 

Next the RSV promoter and Sindbis 5'-end sequences, including the DI 
30 tRNA^ structure, are assembled by overlapping PCR and the entire fragment is 
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inserted into the structural protein gene vector pBGSINspSV. In PCR reaction #1, the 
RSV promoter fragment is amplified by standard three cycle PCR, with a 1 minute 
extension, using the following oligonucleotide primers that are designed to also contain 
a flanking Bgl II site in one primer and sequences overlapping the tRNA 5'-end in the 
5 other. The Sindbis tRNA 5'-end is positioned immediately adjacent to the RSV 
promoter transcription start site. 

Forward pri mer: 5'RSVpro fS'-rest. site/RSV seo.^ (SEQ. ID. NO. 60) 
5'-TATATAGATCT/AGTCTTATGCAATACTCTTGTAGT 

10 

Reverse prim er: 3'RSVtR f5'-Sin tRNA seo/RSV seq/> (SEQ. ID. NO. 61) 
5'-GGGATACTCACCACTATATCTCGACGGTATCGAGGTAGGGCACT 

In PCR reaction #2, the Sindbis 5'-end plus tRNA sequence is amplified by standard 
15 three cycle PCR with a 1 minute extension, from template plasmid 
Totoll01(5'tRNA Asp ) (Bredenbeek et al., J. Virol. 67:6439-6446, 1993), using the 
following oligonucleotide primers that are designed to also contain a flanking BamH I 
site in one primer and sequences overlapping the 3'RSVtR primer in the other. 

20 Forward pri mer: 5'tRNASin (5'-Sindhis + tRNA seq. only) (SEQ. ID. NO. 62) 
5'-GATATAGTGGTGAGTATCCCCG 

Reverse pri mer: 3'SinBam (3'-rest. site/Sindbis seq.^ (SEQ. ID. NO. 63) 
* 5'-TATATGGATCC/CGG AG ATCCTTAATCTTCTC ATG 

25 

Following amplification, the DNA fragments are purified with QlAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 2 minute extension, 
using additional 5'RSVpro and 3'SinBam primers. The resulting overlapping PCR 
amplicon is purified using GENECLEAN II, digested with Bgl II and BamH I, and 
30 ligated into plasmid pBGSINspSV that also has been digested with Bgl II and BamH I, 
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treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. The resulting structural protein expressing, defective helper is 
designated 987DHBB. 

Next, the IRES sequence from encephalomycodarditis virus (EMCV), is 
5 positioned immediately upstream of the neomycin phosphotransferase gene, as a 
selectable marker, by overlapping PCR, and the entire amplicon is inserted into the Nsi 
I site of 987DHBB. Insertion at the Nsi I site will position the selectable marker 
immediately downstream of the structural protein ORE. In PCR reaction #1, the EMCV 
IRES fragment (nucleotides 260-827) is amplified by standard three cycle PCR, with a 
10 30 second extension, from template plasmid pBS-ECAT (Jang et al., J. Virol 65:1651, 
1989), using the following oligonucleotide primers that are designed to also contain a 
flanking Nsi I site in one primer and sequences overlapping the neo gene in the other. 

Forward primer: 5'EMC VIRES fS'-rest. site/EMCV sea.) (SEQ. ID. NO. 64) 
1 5 5'-TATATATGCAT/CCCCCCCCCCCCCAACG 

Reverse primer: 3'EMCVIRES (5'-pcDNA + neo seo/EMCV sea.) (SEQ. ID. NO. 65) 
S'-CATGCGAAACGATCCTCATC/CTTACAATCGTGGTTTTCAAAGG 

20 In PCR reaction #2, the neo resistance marker is amplified by standard three cycle PCR 
with a 1.5 minute extension, from template plasmid pcDNA3 (Invitrogen, San Diego, 
CA), using the following oligonucleotide primers that are designed to also contain a 
flanking Nsi I site in one primer and sequences overlapping the 3'EMCVIRES primer in 
. the other. 

25 

Forward primer: SWeo/pcDNA (5-pcDNA + neo seq. only) (SEQ. ID. NO. 66) 
S'-GATGAGGATCGTTTCGCATGATTGA 



Reverse primer: 3Tsfeo/pcDNA (3-rest site/neo seq.l (SEQ. ID. NO. 67) 
30 5-TATATATGCAT/TCAGAAGAACTCGTCAAGAAGGCGA 
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Following amplification, the DNA fragments are purified with QIAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 2 minute extension, 
using additional 5'EMCVIRES and 3*Neo/pcDNA primers. The resulting overlapping 
5 PCR amplicon is purified using GENECLEAN II, digested with Nsi /, and ligated into 
plasmid 987DHBB that also has been digested with Nsi /, treated with alkaline 
phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. The 
resulting structural protein expression construct, with the IRES/neo insert in the Sindbis 
3"-untranslated region, is designated 987DHBBNeo. 



10 ug of plasmid 987DHBBNeo, using a standard calcium phosphate precipitation 
protocol. Approximately 24 hr post-transfection. the media was replaced with fresh 
media containing I mg/ml of the drug G418. After one additional day, the cells were 
trypsinized and re-plated at 1/10 density in media containing 500 ug/ml G418. After 

1 5 several more passages, the cells were subjected to dilution cloning and individual clones 
were expanded. The ability of individual clones to function as packaging cell lines was 
determined by calcium phosphate transfection of plasmid RSV/Sinrep/LacZ, a Sindbis 
DNA vector expressing (}-gal, and assaying for the presence of packaged vector 
particles in the supernatants after 48 hr. The packaged vector replicons were titered by 

20 the CPE assay described in Frolov and Schlesinger (J. Virol. 68: 1 721-1727, 1994) and 
one that gave high titers of packaged particles, designated 987DH-BBNeo, was used for 
further characterization. Packaged vector titers were determined at 48 hr, following 
transfection of either RNA- or DNA-based Sindbis vectors expressing fj-gal, using 
-several different transfection techniques. The results were as follows: 



10 



To generate stable packaging cell lines, BHK cells were transfected with 



25 



transfection procedure 
electroporation 



RSVSINrep/LacZ DNA (2.5 ug) 

SINrep/LacZ RNA (2.5 ug) 

RSVSINrep/LacZ DNA (2 ug) 
SINrep/LacZ RNA (2 ug) 



nucleic acid added 



titers (infectious units/mil 
1.5 x 10 9 /ml 
6x 10 9 /ml 



electroporation 



Lipofectamine 
Lipofectin 



no packaged particles 
5-6 x 10 7 /ml 
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transfection procedure nucleic acid added titers (infectious units/ml^ 

Calcium Phosphate RSVSINrep/LacZ DN A ( 1 0 ug) i 5 x j o 9 / m i 

In addition, SINrep/LacZ particles that were packaged using 987DH-BB 
cell lines subsequently were used to infect fresh BHK cell monolayers and examine 
both RNA and protein expression patterns. Figure 19 shows the RNA pattern after 
5 BHK cells were infected with two different preparations of SINrep/LacZ particles at a 
MOI of 150 infectious units per cell (lanes 1 and 2), or wild-type Sindbis vims (lane 3), 
as a control. Seven hours post-infection, dactinomycin (1 ug/ml) and [ 3 H]uridine (20 
uCi/ml) were added, followed by harvest and analysis of RNA 4 hr later, according to 
Bredenbeek et al. (J. Virol 67:6439-6446, 1993). The high MOI was used in order to 

10 detect possible recombinants. Horizontal lines to the right of the gel lanes indicate the 
Sindbis and p-gal RNAs of interest. The highest molecular weight band indicates the 
genomic RNA of the replicon or virus (lanes 1 and 2, SINrep/LacZ; lane 3 Sindbis 
virus). The next two RNAs indicated are the genomic RNA of the 987DH-BBNeo PCL 
expression cassette and the inducible subgenomic structural protein mRNA from the 

15 same 987DH-BBNeo PCL cassette. The presence of the latter two bands demonstrates 
that the helper genomic RNA derived from the packaging cell line is also co-packaged. 
The next RNA bands, those present in greatest abundance, are the subgenomic RNAs 
derived from either SINrep/LacZ (lanes 1 and 2) or the Sindbis virus genome (lane 3). 

Protein analysis was performed following infection of BHK 21 cells with 

20 packaged SINrep/LacZ replicons at a MOI of 20 infectious units/cell. Fifteen hours 

post-infection, the cells were labeled with [ 35 S] methionine for 30 minutes, lysates 
made, and the proteins analyzed by SDS-PAGE. As shown in Figure 20 (lanes 2 and 
3), both beta-galactosidase and the Sindbis virus capsid protein are labeled in the vector 
particle-infected cells, but not in uninfected cells (lane 1). The presence of capsid 
25 shows that some of the packaged particles also contain structural protein gene RNA 
transcripts from the PCL. 
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C. Constructio n of "split structural p en e" PCL configurations 

In other embodiments of the present invention, PCL are provided 
wherein the alphavirus structural proteins are expressed, not as a polyprotein from a 
single mRNA, with its native post-translational processing, but rather, as separate 

5 proteins from independent mRNAs that are transcribed via multiple cassettes. Such a 
configuration greatly minimizes the possibility of recombination or co-packaging events 
that lead to formation of replication-competent or infectious virus. In preferred 
embodiments, the capsid protein is expressed from one stably transformed cassette and 
the envelope glycoproteins are expressed together from a second stably transformed 

0 cassette, and each is expressed in a vector-inducible manner from the junction region 
promoter (described above). 

For example, the Sindbis virus capsid protein gene was amplified from 
plasmid pDLTRSINg (Dubensky et al, ibid), by standard three-cycle PCR with a 1.5 
minute extension, using the following oligonucleotide primers that were designed to 

5 contain a flanking Xho I site and capsid protein gene initiation codon or a flanking Not I 
site and translation stop codon. 



Forward primer: SINS'C Xho (S'-rest site/cansid seq.l (SEQ. ID. NO. 68) 
5 r -ATATACTCGAG/ACCACCACCATGAATAGAGGATTC 

Reverse primer: SIN3'C Not f5'-rest. site/stop codon/cansid seq.^ (SEQ. ID. NO. 69) 
S'-TATATGCGGCCGC/TATTA/CCACTCTTCTGTCCCTTCCGGGGT 

Following amplification, the capsid DNA fragment was purified with QIAquick-spin, 
digested with Xho I and Not I, purified using GENECLEAN II, and ligated into the 
DNA-based Sindbis expression vector pDCMVSIN-luc (Dubensky et al., ibid), that also 
had been digested with Xho I and Not I to remove its luciferase reporter gene insert, 
treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 
GENECLEAN II. The resulting capsid protein expression construct was designated 
pDCMVSIN-C. Plasmid pDCMVSIN-C was subsequently digested with BspE I to 
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remove most nonstructural protein gene sequences, and re-ligated to itself under dilute 
conditions to create the DH vector construct, pDCMVSINdl-C. 

Plasmid pDCMVSINdl-C, which also contains a neomycin resistance 
cassette, was transfected into BHK cells using Lipofectamine, as described by the 
5 manufacturer. Approximately 24 hr post-transfection, the cells were trypsinized and re- 
plated in media containing 600 ug/ml of the drug G418 (neomycin). The media was 
exchanged periodically with fresh G418-containing media and foci of resistant cells 
were allowed to grow. Cells were trypsinized and cloned by limiting dilution in 96 well 
tissue culture dishes, and individual cell clones were grown and expanded for screening. 

10 Cells which inducibly expressed capsid protein in response to input vector were 
identified by transfecting with Sindbis luciferase vector RNA, making cell lysates 
approximately 15 hr post-transfection, and performing western blot analysis with a 
rabbit anti-Sindbis polyclonal antibody. Several positive cell clones harboring 
integrated copies of the capsid protein gene expression cassette and inducibly 

15 expressing the protein were identified. 

In order to demonstrate both inducibility and functionality of the 
expressed capsid in the context of "split structural gene" cassettes, an additional 
construct that expressed the Sindbis virus envelope glycoproteins was generated from 
pDCMVSIN-Iuc. Briefly, the Sindbis envelope glycoprotein genes were amplified from 

20 plasmid pDLTRSINg by standard three-cycle PCR, with a 2.5 minute extension, and 
using the following oligonucleotide primers that are designed to contain a flanking Xho 
I site and translation initiation codon in good Kozak context, or a flanking Not I site and 
the translation stop codon. 

25 Forward primer: 5'GLYCO-X (5'-rest. site/initiation codon/glvcoprotein seq.) 
(SEQ. ID. NO. 70) 

5'-ATATACTCGAG/AGCAATG/TCCGCAGCACCACTGGTCACGGCA 

Reverse primer: 3'GLYCO-N (S'-rest. site/glvcoorotein seq.) (SEQ. ID. NO. 71) 
30 5'-ATATAGGCGGCCGC/TCATCTTCGTGTGCTAGTCAGCATC 



WO 97/38087 



160 



PCT/US97/06010 



Following amplification, the glycoprotein gene DNA fragment was purified with 
QIAquick-spin, digested with Xho I and Not I, purified using GENECLEAN II, and 
ligated into the DNA-based Sindbis expression vector pDCMVSIN-Iuc, that also had 
5 been digested with Xho I and Not I to remove its luciferase reporter gene insert, treated 
with alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN 
II. The resulting glycoprotein expression construct was designated pDCMVSINl.SPE. 

Figure 21 is a western blot demonstrating vector controlled inducibility 
of two different clonal capsid lines (9-3 and 9-9) cell lines, that were transfected with 

10 Sindbis DNA vectors expressing the envelope glycoproteins (1.5PE lanes) or a p. 
galactosidase reporter, pDCMVSIN-p-gal (Dubensky et a!., ibid ; 1.5p-gal lanes), or 
"mock" transfected (M lanes), using Lipofectamine. Cell lysates were made at 48 hr 
post-transfection, separated by SDS-PAGE, and transferred to membranes, where they 
were probed with a combination of antibodies specific for Sindbis structural proteins 

15 and p-galactosidase. The blot clearly shows the inducibility of capsid protein in 
response to the nonstructural proteins supplied by either vector, as well as the 
expression of p-galactosidase and the envelope glycoproteins. Functionality of the 
"split structural gene" capsid cell lines, by complementation and vector particle 
packaging, was demonstrated by co-transfecting the p-galactosidase and envelope 

20 glycoprotein vectors into a capsid cell line using Lipofectamine, and assaying for 
packaged particles in the culture supematants. Approximately 48 hr post-transfection, 
the supematants were harvested and clarified for the packaging assays and vector titer 
determination. In addition, the cells were lysed using Lameli sample buffer and 
examined by western blot analysis with polyclonal anti-Sindbis antibody, demonstrating 

25 expression of both capsid protein and the vector supplied envelope glycoproteins. The 
supematants were then tested for the presence of packaged vector particles by infecting 
naive BHK cells for approximately 18 hr, and staining for (J-gal reporter gene 
expression, as described previously in this example. Functionality of the cell lines for 
complementation and packaging was demonstrated by the observance of blue-stained 

30 P-gal expressing cells. 
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To generate stable "split structural gene" PCL that have separate vector 
inducible expression cassettes for both capsid protein and the envelope glycoproteins, 
the above described capsid cell lines are used, in conjunction with an additional 
envelope glycoprotein expression construct that contains a different selectable marker 
5 (for example, hygromycin B resistance). Briefly, plasmid pBG/SIN-1 ELVS 1.5-P-gal 
(Example 5) is digested with BamH I and Fsp I to isolate the sequences comprising the 
junction region promoter, the |J-gal reporter gene, and some 3 -end sequences. The 
desired fragment is purified from a 1% agarose gel using GENECLEAN II, and ligated 
into plasmid pBGSVCMVdlneo (see above) that also has been digested with Bam HI 

10 and Fsp I to eliminate all structural protein gene sequences, treated with alkaline 
phosphatase, and purified from a 0.7% agarose gel using GENECLEAN II. The 
resulting construct is designated as pBGSVCMVdlsP-luc. Plasmid pBGSVCMVdlsP- 
luc is next digested with Xho I and Not 1 to remove the luciferase reporter gene, treated 
with alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN 

15 II, and the Xho 1- and Not I-digested envelope glycoprotein PCR amplicon from above 
is subsequently ligated into the digested pBGSVCMVdlsP-luc vector to generate the 
envelope glycoprotein expressing DH construct, pBGSVCMVdl-G. Insertion of a 
hygromycin resistance marker cassette into this plasmid, as well as flanking HSV TK 
promoter and polyadenylation sequences, is accomplished by PCR amplification, using 

20 a standard three-cycle protocol with 2.5 minute extension, plasmid pDR2 (Clontech, 
Palo Alto, CA) as template, and the following oligonucleotide primers that are designed 
to contain flanking Pac I sites. 

. Forward primer: 5'HYGRO/Pro-P (S'-rest site/pDR2 seq.) (SEQ. ID. NO. 72) 
25 S'-ACACATTAATTAA/CGATGCCGCCGGAAGCGAGAA 



Reverse primer: 3'HYGRO/pA-P (S'-rest. site/pDR2 sea.) (SEQ. ID. NO. 73) 
5-ACACATTAATTAA/GTATTGGCCCCAATGGGGTCT 
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Following amplification, the DNA fragment is purified with QIAquick-spin, digested 
with Pac I, purified using GENECLEAN II, and ligated into plasmid pBGSVCMVdl-G 
that also has been digested with Pac I, treated with alkaline phosphatase, and purified 
using GENECLEAN II. The resulting construct is designated as pBGSVhygro-G. 
5 Plasmid pBGSVhygro-G, which now also contains a hygromycin 

resistance cassette, is transfected into the capsid cell line using Lipofectamine, as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells are 
trypsinized and re-plated in media containing 1200 ug/ml of hygromycin (Calbiochem, 
La Jolla, CA). The media is exchanged periodically with fresh hygromycin-containing 

1 0 media and foci of resistant cells are allowed to grow. Cells are trypsinized and cloned 
by limiting dilution in 96 well tissue culture dishes, and individual cell clones arc 
grown and expanded for screening. Cells which inducibly express biologically active 
capsid protein and envelope glycoproteins in response to input vector are identified in 
two ways: first, by Lipofectin-mediated transfection with Sindbis luciferase vector 

15 RNA and probing 15 hr post-transfection lysates in western blots, as described above; 
and second, by similar Sindbis luciferase vector transfection, but harvesting the 
supematants at 24 hr post-transfection, and testing for the transfer of luciferase 
expression to fresh BHK cells by infection with the supematants (see above). Split 
structural gene PCL derived in this manner are designated C/GLYCO PCL. 



20 



D - Constructio n of PCL with "hybrid" structural proteins 

An additional approach which may be utilized to decrease the level of 
co-packaging or recombination between DH and vector RNA molecules, to enhance 
- translation of the glycoprotein genes, or to alter the cell or tissue specificity of the 

25 packaged recombinant alphavirus vector particles, makes use of structural protein genes 
derived from other alphaviruses or togaviruses. More specifically, numerous 
combinations of alphavirus or togavirus structural protein genes for use with Sindbis 
virus or different alphavirus vectors can be envisioned. For example, the capsid protein 
gene of Ross River virus (RRV), may be used in conjunction with the envelope 

30 glycoprotein genes of Sindbis virus (expressed from the same or a different construct), 
to package a Sindbis virus-derived vector described in examples 3, 4, or 5. In addition, 
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a deleted form of the RRV capsid protein gene may be positioned immediately 
upstream of the Sindbis glycoprotein gene sequences to serve as a translational 
enhancer elements. As another example, the structural proteins of Sindbis virus may be 
used to package Semliki Forest virus RNA vectors. 
5 Specifically, defective helper (DH) structural protein constructs that 

contain an intact or deleted form of the RRV capsid gene (Figure 22), plus the Sindbis 
glycoprotein genes, were constructed by PCR amplification of Sindbis virus or Ross 
River virus sequences from piasmid templates Totol 101 (Rice et al., J. Virol 67:3809- 
3819, 1987) and RR6415 (Kuhn et al., Virology 752:430-441, 1991), respectively, and 

1 0 incorporation of those sequences into the previously described DH constructs DH-BB 
or DH-BB(5*SIN) (Bredenbeek et al., J. Virol. 67:6439-6446, 1993), for transcription 
in vitro with SP6 polymerase. The DH constructs DH-BB or DH-BB(S'SIN) different 
by the presence (DH-BB) or absence (DH-BB(5 , SIN)) of a Dl-derived tRNA Asp 
sequence at the Sindbis 5'-end. The following table indicates the specific PCR primer 

15 sequences and their corresponding Sindbis or Ross River nucleotide positions, with 
capital letters indicating viral nucleotides and lower case letters indicating additiortal 
nucleotides of restriction sites used in cloning steps or specific point mutations. Point 
mutations in primer 6 (RRV Bsp) did not change the resulting amino acid sequence. 

20 Primers used for PCR: 

1 . RRV Ava: (nt.75 10-75251 (SEQ. ID. NO. 74) 
5-ccacgaattcGGTCCTAAATAGATGC 

25 2. RRV Neil: (nt.7606-7620) (SEQ. ID. NO. 75) 
5'-ccacaagcttCCGGGCGAGGCCGCC 

3. RRV Nci2: (nt.76 16-7630) (SEQ. ID. NO. 76) 
5-ccacggatCCCGGCGTTCCGTCC 
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4- RRVApal: (nt.8088-81021 (SEQ. ID. NO. 77) 

5'-ccacaagcttGTGCACTGGGATCTG 

5- RRV Aoa2: fnt.8097-81 1 H (SEQ. ID. NO. 78) 
5 5'-ccacggatccGTGCACATGAAGTCC 

6. RRVBsp: fnt.8339-836H (SEQ. ID. NO. 79) 

5'-ccacaagCTTCcGGaGTTACCCGAGTGACC 

10 7. RRVAfll: (nt.7820-78361 (SEQ. ID. NO. 80) 

5-ccaccttaaGCGTCGGCTTTTTCTTC 

8- RRVAA2: (nt.7892-79071 (SEQ. ID. NO. 8 1 ) 

5'-ccaccttaaGAGAAGAGAAAGAATG 

15 

9- SIN Ava: fnt.759 1-75941 (SEQ. ID. NO. 82) 

5'-ccacaagcttGGACCACCGTAGAG 

10. SIN Bam: (nt.7325-73431 (SEQ. ID. NO. 83) 
20 5'-CCGCGTGGCGGATCCCCTG 

1 1 SINBsp: fnt.841 8-84331 (SEQ. ID. NO. 84) 
5'-ccacggatCCGGAAGGGACAGAAG 

25 12. SINBsu: (nt.8887-89021 (SEQ. ID. NO. 85) 

5'-CACGGTCCTGAGGTGC 



PCR reactions were performed using the primer pairs indicated below, in a standard 
three cycle protocol, with 30 sec extensions and Vent polymerase, to produce the 
30 corresponding DNA fragments, which are also indicated below. 
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PCR fragments:primer pairs, plasmid template 
Fragment 1: prl+pr2, RRV6415 plasmid 
Fragment 2: pr3+pr4, RRV6415 plasmid 
5 Fragment 3: pr5+pr6, RRV6415 plasmid 
Fragment 4: pr3+pr7, RRV64 1 5 plasmid 
Fragment 5: pr4+pr8, RRV6415 plasmid 
Fragment 6: pr9+prl0, Totol 1 01 plasmid 
Fragment 7: prl l+prl2, Totol 101 plasmid 

10 

Following amplification, the PCR products were digested with the indicated enzymes, 
and ligated into the pUC18 plasmid analog, pRS2, which contains additional polylinker 
sites and which had also been digested with the same enzyme combinations: fragment 1 
was cut with EcoR l+Hind III; fragment 2 with BamH \+Hind III; fragment 3 with 

15 BamH l+Hind III; fragment 4 with BamH l+Afl II; fragment 5 with Hind Hl+Afl II; 
fragment 6 with BamH l+Hind III; and fragment 7 with BamH l+Bsu 361 All 
insertions were sequenced to verify that artifacts had not been acquired during PCR. 

Subsequently, the fragments were released from the pRS2 plasmids 
using the enzymes indicated below, and ligated exactly as indicated to generate the next 

20 set of constructs. To generate FR8, fragment 6 (cut by Bam HI and Ava II) was ligated 
with fragment 1 (cut by Ava II and Nci 7), fragment 2 (cut by Net I and Hind III) and 
plasmid pRS2 (cut by Bam HI and Hind III). To generate FR9, fragment 6 (cut by 
BamH I and Ava II) was ligated with fragment 1 (cut by Ava II and Nci /), fragment 4 
. (cut by Nci I and Afl II) and plasmid pRS2 (cut by BamH I and Afl II). To generate 

25 FR10, fragment 5 (cut by Afl II and ApaL I) was ligated with fragment 3 (cut by ApaL I 
and Hind III) and plasmid pRS2 (cut by Afl II and Hind III). After transformation of 
E. colu plasmids were analyzed by restriction analysis and their inserts were again 
isolated by digestion and used for the next steps of cloning. 

The FR8 insert (cut by BamH I and ApaL I) was ligated with fragment 3 

30 (cut by ApaL 1 and BspM II), fragment 7 (cut by BspM II and Bsu36 1) and plasmid DH- 
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BB (cut by BamH I and Bsu36 I). The same fragments also were used to replace the 
BamH 1-Bsu36 I fragment of plasmid DH-BB(5'SIN). The resulting plasmids were 
designated DH-BB Citv and DH-BB(5'SIN) Crrv, respectively (see Figure 23). The 
FR9 insert (cut by BamH I and Afl II) was ligated with the FRIO insert (cut by Afl II and 
5 BspM II), fragment 7 (cut by BspM II and Bsu36 1) and plasmid DH-BB (cut by BamH I 
and Bsu36 I). The same fragments also were used to replace the BamH \-Bsu36 I 
fragment of plasmid DH-BB(5 , SIN). The resulting plasmids were designated DH-BB 
CArrv and DH-BB(5'SIN) CArrv (see Figure 23). 

Multiple uses for DH constructs that contain chimeric structural protein 

10 genes are possible, and two such approaches are illustrated in Figures 24 and 25. In 
Figure 24, the intact Ross River capsid protein gene is linked with the Sindbis 
glycoprotein gene sequences (DH-BB Crrv or DH-BB(5 , SrN) Crrv), as part of a 
defective helper construct, and co-transfected with a Sindbis reporter RNA vector 
replicon to demonstrate packaging into recombinant alphavirus particles (Figure 26). In 

15 Figure 25, the deleted form of the Ross River capsid protein gene is linked with the 
Sindbis glycoprotein gene sequences (DH-BB CArrv and DH-BB(5 , SIN) CArrv), as a 
translations enhancer and part of the DH construct, while the Sindbis capsid protein 
gene expressed from a second DH construct. Both DH constructs are co-transfected 
with a Sindbis reporter RNA vector replicon to demonstrate packaging into recombinant 

20 alphavirus particles (Figure 26). Additionally, the Ross River capsid protein gene may 
be expressed alone from one DH construct, while the Sindbis glycoproteins are 
expressed from another, for use in packaging. Using this knowledge and the 
availability of several other alphaviruses from which to derive structural protein gene 
sequences, a large number of different protein combinations may be generated in similar 

25 approaches. 

Alternatively, the entire complement of structural protein genes from one 
alphavirus, or other members of the Togaviridae (e.g., rubella virus) may be used to 
package an RNA vector derived from another, as shown above for SFV vectors and 
Sindbis structural proteins. In an alternative embodiment, the structural protein genes 
30 from Venezuelan equine encephalitis (VEE) virus may be used to package a Sindbis- 
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virus derived vector (wild-type or displaying the phenotype described in Example 4 or 
5). Such a method provides recombinant alphavirus particles containing vector RNAs 
which exhibit the desirable properties of the present invention, such as delayed, reduced 
or no inhibition of host macromolecular synthesis, plus, structural proteins which 
5 redirect the tropism of the recombinant particle. Venezuelan equine encephalitis virus 
(VEE) is an alphavirus which exhibits tropism for cells of lymphoid origin, unlike its 
Sindbis virus counterpart. Therefore, Sindbis-derived vector constructs packaged by a 
cell line expressing the VEE structural proteins will display the same lymphotropic 
properties as the parental VEE virus from which the packaging cell structural protein 

10 gene cassette was obtained. 

Specifically, the Trinidad donkey strain of VEE virus (ATCC #VR-69) is 
propagated in BHK-21 cells, and virion RNA is extracted using procedures similar to 
those described in Example 1 . The entire structural protein coding region is amplified 
by PCR with a primer pair whose 5'-ends map, respectively, to the authentic AUG 

15 translational start site, including the surrounding Kozak consensus sequence, and the 
UGA translational stop site. The forward primer is complementary to VEE nucleotides 
7553-7579, and the reverse primer is complementary to VEE nucleotides 1 1206-1 1186 
(sequence from Kinney et al., Virology 770:19-30, 1989). PCR amplification of VEE 
cDNA corresponding to the structural protein genes is accomplished using a two-step 

20 reverse transcriptase-PCR protocol as described above, the VEE genome RNA as 
template, and the following oligonucleotide pair, which contain flanking Xho 1 and Not 
I sites: 

. . Forward primer: VEE 75S3F (5-rest. site/VEE capsid seq.^ (SEQ. ID. NO. 86) 
25 5 , -TATATATATCTCGAGACCGCCAAGATGTTCCCGTTCCAGCCA-3' 

Reverse primer: VEE 11206R (S'-rest site/VEE El elvcoseq.) (SEQ. ID. NO. 87) 
S-TATATATATGCGGCCGCTCAATTATGTTTCTGGTTGGT-S 1 
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Following PCR amplification, the approximately 3800 bp fragment is purified from a 
0.7% agarose gel using GENECLEAN II, and digested with Xho I and Not I. The 
resulting fragment is then ligated into the DNA-based Sindbis expression vector 
pDCMVSIN-luc (see above), that also has been digested with Xho I and Not I to remove 
5 its luciferase reporter gene insert, treated with alkaline phosphatase, and purified from a 
0.7% agarose gel using GENECLEAN II. The resulting VEE structural protein 
expression construct is designated pDCMV-VEEsp. Plasmid pDCMV-VEEsp 
subsequently is digested, under limiting partial digest conditions, with BspE I to remove 
most nonstructural protein gene sequences, and re-ligated to create the structural 

1 0 protein-expressing DH vector construct, pDCM V- VEEdl . 

Plasmid pDCMV-VEEdl, which also contains a neomycin resistance 
marker, is transfected into BHK cells using Lipofectamine, as described by the 
manufacturer. Approximately 24 hr post-transfection, the cells are trypsinized and re- 
plated in media containing 600ug/ml of the drug G418. The media is exchanged 

1 5 periodically with fresh G4 1 8-containing media and foci of resistant cells are allowed to 
grow. Cells are trypsinized and cloned by limiting dilution in 96 well tissue culture* 
dishes, and individual cell clones are grown and expanded for screening. Cells which 
inducibly express VEE structural proteins in response to input vector are identified by 
transfecting with Sindbis luciferase vector RNA, and assaying for VEE structural 

20 protein expression in cell lysates or packaged luciferase vector in the supematants, as 
described previously. Structural protein genes obtained from variants of VEE, or other 
alphaviruses and their variants differing in tissue tropism, also are useful when 
following this approach. In addition, each of the various structural protein gene 
expression cassette configurations described in this example may be used. 

25 

E- Production of packaged aiphavirus vectors from PCL 

Aiphavirus derived PCL described throughout this example may be used 
in a number of different ways to produce recombinant aiphavirus particle stocks, and 
include the introduction of vector by either transfection of RNA or DNA molecules, 
30 infection with previously produced packaged vector-containing particles, or the 
intracellular production of vector from stably transformed expression cassettes. The 
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utility of alphavirus PCL for the production of vector particles is demonstrated first with 
a reporter vector construct, and later may be applied to any other vector constructs 
which express a desired heterologous sequence. For example, a stock of packaged 
Sindbis-p-gal vector particles is obtained by electroporation of approximately 10 7 
5 alphavirus C/GLYCO or other PCL cells (see above, and for example, figures 15, 17, 
18) with 50 ug pKSSIN-l-BV-|3-gal or -luciferase RNA (Example 4) or pBG/SIN-1 
ELVS 1.5-P-gal or -luciferase DNA (Example 5), using the procedure described in 
(Liljestrom and Garoff, Bio/Technology 9:1356-1361, 1991). The transfected PCL are 
incubated at a desired temperature (e.g., 37°C), and at approximately 48 hr. post- 
10 transfection, the supernatants are harvested and clarified by passage through a 0.45 
micron filter. Additional formulation may be performed using parameters illustrated in 
the detailed description of this invention. 

Alternatively, a stock of packaged recombinant alphavirus particles 
(obtained using PCL as above, or by co-transfection of vector and DH constructs) is 
15 used to infect a naive culture of PCL, for further amplification. For example, 5xl0 7 of 
alphavirus C/GLYCO or other PCL cells are infected with a stock of packaged 
pKSSIN-l-BV-p-gal vector at an approximate multiplicity of infection (MOI) of 1 
infectious unit/cell. Upon reaching the cell cytoplasm, the particle delivered RNA 
vector is autocatalytically amplified and packaged into additional progeny particles. 
20 After incubation at the desired temperature (e.g., 37°C) for approximately 48 hr., the 
culture supernatants are harvested, clarified by passage through a 0.45 micron filter, and 
formulated as desired. 

In still another method which exploits the ability of PCL to further 
* amplify packaged recombinant alphavirus vector particles, stocks of packaged particles 
25 are used to infect naive cultures of PCL to create a working cell bank of vector- 
containing PCL (vector producing cells, Figure 27), which may subsequently be used to 
seed another naive culture of PCL. For example, such a working cell bank is obtained 
by infection of alphavirus C/GLYCO or other PCL cells with the packaged vector stock 
at a M.O.I.= 5. Approximately 2-3 hr post-infection, the vector containing PCL are 
30 gently detached and cell number is determined. The vector containing PCL may now 
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be used directly, or aliquoted and stored in liquid N 2 as a vector producing cell bank. 
When desired, the cells are seeded directly into a previously growing culture of naive 
alphavirus C/GLYCO or other PCL at a ratio of approximately 1 vector producing cell 
per 1000 fresh PCL, for production of large quantities of high titer packaged vector 
5 particles. Aliquots of culture supernatant are harvested at various times post-coculture 
to determine the time of maximal recombinant alphavirus particle production, and that 
time is chosen for further harvest, purification and formulation, as described above. 
The- same sequential amplification methodology using vector producing cells also is 
useful for large-scale production of any desired recombinant protein (Figure 27). For 

10 the production of recombinant protein, supernatants or cell lysates may be harvested, 
depending on the nature of the recombinant protein. 

In yet another method for producing high titer or large scale stocks of 
packaged recombinant alphavirus particles, the desired expression vector is introduced 
into 1-5% of a naive alphavirus PCL culture by transfection of in vitro transcribed RNA 

15 or plasmid DNA vector using a commonly accepted reagent or method (for example, 
Lipofectin or Lipofectamine, respectively, or infection with vector particles at low MOI 
[< 0.1]), as described herein. The recombinant vector particles produced by the initial 
cells, into which vector was introduced, subsequently infect other naive packaging cells 
in the culture, which in turn, produce even more packaged particles. This process of 

20 temporal amplification continues until packaged recombinant alphavirus particles are 
produced in all cells of the PCL culture. This amplification process is demonstrated in 
Figure 28. ELVS 1.5-P-gal plasmid DNA was transfected into 987DHBBNeo 
packaging cells or into BHK-21 cells, and the levels of P-galactosidase present in cell 
lysates was measured, as described previously, at the indicated times post-transfection. 

25 In BHK-21 cells, the level of p-galactosidase expression reached a maximum by 
approximately 48 hpt, and plateaued. In contrast, the level of p-galactosidase 
expression continued to increase over a longer period of time in the ELVS 1 .5-p-gal 
transfected 987DHBBNeo PCL culture, reflecting the recombinant vector particle 
amplification process, and the ultimate expression of p-galactosidase in all of the cells 

30 of the culture. In all cases, stocks of recombinant alphavirus vector particles may be 
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formulated so as to be pharmaceutical ly acceptable, using any of the methods described 
herein. 



5 EXAMPLE 7 

Construction of Alpha virus Producer Cell Lines 

The generation of alphavirus PCL, as described above, coupled with the 
construction of DNA-based alphavirus vectors exhibiting reduced, delayed, or no 

10 inhibition of host cell macromolecular synthesis (Examples U 2, 4 and 5), provides a 
relatively straightforward mechanism to derive alphavirus vector producer cell lines. In 
certain embodiments of the present invention, the vector producer cell lines contain one 
or more stably transformed structural protein gene expression cassettes, and also 
alphavirus RNA expression vector molecules with the above phenotype, that are 

15 transfected, transduced, or intracellular^ produced, leading to the production of 
packaged vector particles. In preferred embodiments, an RNA vector replicon 'is 
produced intracellular^ from a stably transformed DNA molecule (eukaryotic layered 
vector initiation system) that exists in either an integrated form or as an episomal DNA, 
with transcription of vector RNAs being controlled inducibly by one or more stimuli 

20 provided at a desired time. This type of alphavirus producer cell line configuration 
essentially provides a cascade of events that include: inducible production of vector 
RNA and resulting autocatalytic cytoplasmic amplification of the RNA, the induction of 
high level structural protein expression by vector-supplied nonstructural proteins, the 
^ packaging of vector RNA by the expressed structural proteins, and the release of 

25 packaged vector particles. Tightly regulated, inducible expression of vector RNA from 
the DNA molecule, once producer cell population reaches as desired number, is 
preferred, due to the potential for low level cytotoxicity of vector replication, or the 
necessity to control nonstructural protein synthesis, as it relates to the regulation of 
positive strand versus negative strand vector RNA ratios. 



30 
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A. Alphavinis DNA Vectors with Single Level Regulation 

In certain embodiments of the present invention, a DNA-based 
alphavinis vector is provided, wherein in vivo transcription of an alphavirus vector 
RNA molecule that is capable of autocatalytic amplification occurs from a promoter 

5 which is regulatable by applying a stimulus at a desired time. Such a DNA-based 
alphavirus vector subsequently may be stably transformed into an alphavirus packaging 
cell line (PCL) to create an inducible alphavirus producer cell line. The producer cell 
line configuration described herein, is therefore,- a "feed-forward" system in which: 1) a 
stimulus is applied to the cell, resulting in efficient transcription of alphavirus vector 

0 RNA; 2) the vector RNA replicates autocatalytically and produces nonstructural 
proteins; 3) the nonstructural proteins stimulate amplification of the structural protein 
expression cassette mRNAs and high level structural protein expression; and 4) the 
structural proteins interact with the vector RNA and result in the subsequent packaging 
of recombinant alphavirus particles which are released into the culture media. Any 

5 previously described alphavirus PCL, which is stably transformed with one or more 
inducible alphavirus structural protein expression cassettes, may serve as the parental 
line with which to derive the producer cell line. 

For example, a tetracycline-responsive promoter system (Gossen and 
Bujard, Proc. Natl. Acad. Sci 59:5547-5551, 1992) may be utilized for inducible 

0 transcription of an alphavirus vector RNA, as depicted in Figure 29. In this system, the 
expression of a tetracycline repressor and HSV-VP16 transactivator domain, as a 
"fusion" protein (rTA), stimulates in vivo transcription of the alphavirus vector RNA by 
binding specifically to a tetracycline operator sequence (tetO) located immediately 
* adjacent to a minimal "core" promoter (for example, CMV). The binding and 

5 transactivation event is reversibly blocked by the presence of tetracycline, and may be 
"turned on" by removing tetracycline from the culture media. As uninduced basal 
levels of transcription will vary among different cell types, other different minimal core 
promoters (for example HSV-tk) may be linked to the tetracycline operator sequences, 
provided the transcription start site is known, to allow juxtaposition at or in the 

0 immediate proximity of alphavirus vector nucleotide 1 . 
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The rTA transactivator is provided by an additional expression cassette 
also stably transformed into the same cell line; and in certain embodiments, the rTA 
expression cassette may itself be autoregulatory. The use of an autoregulatory rTA 
expression cassette circumvents potential toxicity problems associated with constitutive 
5 high level expression of rTA by linking expression to transcriptional control by the 
same tetO-linked promoter to which rTA itself binds. This type of system creates a 
negative feedback cycle that ensures very little rTA is produced in the presence of 
tetracycline, but becomes highly active when the tetracycline is removed (Figure 29). 
Such an autoregulatory rTA expression cassette is provided in plasmid pTet-tTAk 

10 (Shockett et aL, Proc. Natl Acad. ScL USA 92:6522-6526, 1995). 

Functionality of such a tetracycline-regulated DNA-based alphavirus 
vector is demonstrated by constructing a modified SIN- 1 -derived luciferase plasmid 
vector, which is driven by a tetracycline operator/CMV minimal promoter. Using 
plasmids pBG/SrN-1 ELVS1.5-luc (Example 4) and pBGSV3 t (Example 6) as starting 

15 material, an approximately 7200 bp fragment, including much of the SIN-1 
nonstructural-encoding region, the junction region promoter and luciferase reporter 
gene, and a portion of the 3 , -UTR, is isolated by digestion of pBG/SIN-1 ELVS1.5-luc 
with Bgl II and Fsp I, and purification from a 0.7% agarose gel using GENECLEAN II. 
The 7200 bp fragment is subsequently ligated into plasmid pBGSV3' that has also been 

20 digested with Bgl II and Fsp I, treated with alkaline phosphatase, and purified from a 
0.7% agarose gel using GENECLEAN II. The resulting construct is designated 
pBGSVdlB/SINl-luc. Insertion of the remaining sequences, which include the 
heptamerized tetracycline operator and minimal CMV promoter (tetO/CMV) linked to 
- Sindbis nucleotides 1-2289, such that transcription will initiate with one additional 

25 nonviral nucleotide 5* of Sindbis nucleotide 1, is accomplished by overlapping PCR. In 
PCR reaction #1, the approximately 370 bp tetO/CMV portion of the sequence is 
amplified by standard three-cycle PCR with a 30 second extension from template 
plasmid pUHC13-3 (Gossen and Bujard, ibid) using the following oligonucleotide 
primers that are designed to also contain flanking Bgl II and Asc I sites on one primer 

30 and sequences overlapping 5'-Sindbis nucleotides on the other. 
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Forward primer: 5'BAtet OF (5'-rest. sites/tetQ nts/> (SEQ. ID. NO. 88) 
S'-TATATAGATCTGGCGCGCC/TTTACCACTCCCTATCAGTGATAGO' 

5 Reverse primer: 3'CMVpr o/SINR (5-Sindbis nts /CMV nis ) (SEQ. ID. NO. 89) 
5'-TACGCCGTCAAT/ACGGTTCACTAAACGAGCTCTGC-3' 

In PCR reaction #2, the 2289 bp Sindbis 5*-end portion of the sequence is amplified by 
standard three-cycle PCR with a three minute extension, from template plasmid 
pKSRSIN-1 (Example 1), using the following oligonucleotide primers that are designed 
to also contain sequences overlapping the CMV promoter nucleotides on one primer. 

Forward primer: CMVSINS' endF (5'-CM\ nts./Sindhis nts ) (SEQ. ID. NO. 90) 
5'-TAGTGAACCGT/ATTGACGGCGTAGTACACACTATT 

Reverse pri mer: SIN2400R fall Sindbis nts/l (SEQ. ID. NO. 91) 
5'-CGTTGAGCATAACCGAATCTAC 

Following amplification, the DNA fragments are purified with QIAquick-spin and used 
together as templates in a subsequent three-cycle PCR reaction with 3.5 minute 
extension, using additional 5'BAtetOF and SIN2400R primers. The resulting 
overlapping PCR amplicon of approximately 2660 bp is purified using GENECLEAN 
II, digested with Bgl II, and ligated into plasmid pBGSVdlB/SINl-luc that has also 
. been digested with Bgl II, treated with alkaline phosphatase, and purified from a 0.7% 
agarose gel using GENECLEAN II. The resulting construct is designated ptetSINl-luc. 
Vector constructs containing other heterologous sequences-of-interest are generated 
using a similar approach, or by direct cloning into the Xho I and/or Not I sites. 
Subsequently, a selectable E. coli gpt gene (xanthine-guanine phosphoribosyltrans- 
ferase) expression cassette is generated and inserted into the unique Pac I site of 
plasmid ptetSINl-luc, to provide an additional selectable marker. First, a fragment 
containing the SV40 promoter linked to a gpt gene open reading frame is amplified 
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from plasmid pMAM (Clontech, Palo Alto, CA) by standard three-cycle PCR with a 2 
minute extension, and using the following oligonucleotide primers that are designed to 
contain upstream flanking Sac I and Pac I sites and a downstream Sac I site. 

5 Forward primer: SV4QproSPF (5'-rest. sites/SV40 promoter seq.l (SEQ. ID. NO. 92) 
S'-ATATAGAGCTCTTAATTAA/TCTTTGTGAAGGAACCTTACTTC 

Reverse primer: 3'ECgptR (5'-rest. site/gp; gene seq.) (SEQ. ID. NO. 93) 
5'-ATATAGAGCTC/AGGCGTTGAAAAGATTAGCGACCG 

10 

Following amplification, the SV40 promoter/gpf gene DNA fragment is purified with 
QIAquick-spin, digested with Sac /, purified using GENECLEAN II, and ligated into 
plasmid pBGS131 dUTioI-BGHTT (Example 5) that also had been digested with Sac /, 
treated with alkaline phosphatase, and purified from a 0.7% agarose gel using 

15 GENECLEAN II. Clones with proper orientation of the insert are identified by 
restriction analysis. This configuration positions the promoter and gpt gene 
immediately adjacent to a bovine growth hormone transcription termination signal. The 
resulting gpt expression construct is designated pBGS131 dlATroI-gpt. Next the entire 
expression cassette is amplified from plasmid pBGS131 dlATwI-gpt by standard three- 

20 cycle PCR with a 2 minute extension, and using the following oligonucleotide primers 
that are designed to contain flanking Pac I sites. 

Forward primer: SV4QproSPR as shown above (SEQ. ID. NO. 92) 

25 

Reverse primer: BGHTTpacR (S'-rest. site/BGH sea.) (SEQ. ID. NO. 94) 
5'-TATATATTAATTAA/ATAGAATGACACCTACTCAGACAATGCGATGC 

Following amplification, the gpt gene expression cassette fragment is purified with 
30 QIAquick-spin, digested with Pac I, purified using GENECLEAN II, and ligated into 
the tet-inducible alphavirus vector construct ptetSINl-luc that also had been digested 
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with Pac I, treated with alkaline phosphatase, and purified from a 0.7% agarose gel 
using GENECLEAN II. The resulting construct is designated ptetSINlgpt-luc. 

For construction of an initial tetracycline-inducible alphavirus vector 
producer cell line, the ptetSINlgpt-luc construct and a tetracycline repressor/VP16 
5 transactivator (rTA) expression cassette are stably transformed into the desired 
alphavirus PCL. For example, alphavirus C/GLYCO PCL cells (from above) are stably 
transformed with plasmid pTet-tTAk (see above) by cotransfection with another 
plasmid encoding a selectable marker. Plasmids pTet-tTAk and pSV2-His, encoding a 
histidinol dehydrogenase marker (Schatz et al., 1989, Cell 59:1035-1048), are co- 

10 transfected into C/GLYCO PCL cells (or other PCL) at a molar ratio of 40: K 
respectively, using Lipofectamine, as described by the manufacturer. Approximately 24 
hours post-transfection, the cells are trypsinized and re-plated in media containing 
histidinol and 0.5 ug/ml tetracycline. The media is exchanged periodically with fresh 
drug-containing media, and foci of resistant cells are allowed to grow. Cells are 

15 trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones are grown are expanded for screening. Positive pTet-tTAk- 
containing packaging cell clones, designated C/GLYCO/TAk cells, are identified by 
transfecting the luciferase reporter plasmid pUHC13-3 (Gossen and Bujard, ibid), under 
the control of a tetO/promoter, in both the presence or absence of tetracycline. In the 

20 absence of tetracycline, positive C/GLYCO/TAk PCL cells will provide induction from 
the tetO/promoter and inducible, high levels of luciferase. 

Subsequently, the DNA-based alphavirus vector construct ptetSINlgpt- 
luc is stably transfected into the C/GLYCO/TAk cells using Lipofectamine, as 
described by the manufacturer. Approximately 24 hr post-transfection, the cells are 

25 trypsinized and re-plated in selection media, optimized for the particular cell type 
(DMEM + 10% dialyzed fetal calf serum; 250 ug/ml xanthine; 15 ug/ml hypoxanthine; 
10 ug/ml thymidine; 2 ug/ml aminopterin; 25 ug/ml mycophenolic acid), and containing 
0:5 ug/ml tetracycline. The media is exchanged periodically with fresh selection media, 
and foci of resistant cells are allowed to grow. Cells are trypsinized and cloned by 

30 limiting dilution in 96 well tissue culture dishes, and individual cell clones are grown 
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are expanded for screening. Positive producer cell lines, stably transformed with 
ptetSINlgpt-luc, are identified by removing tetracycline from the media for at least 24 
hr and testing for luciferase in cell lysates and also testing for packaged luciferase 
vector in the culture supernatants, as described previously. 

5 

B. Alphavirus DNA Vectors With Two Level Regulation 

In preferred embodiments, it may be desirable to construct a DNA-based 
alphavirus vector (wild-type or with the desired phenotype of reduced, delayed or no 
inhibition of host macromolecular synthesis), wherein transcription of the RNA vector 

10 molecule, capable of autocatalytic amplification, occurs from a promoter which is very 
tightly controlled by two levels of regulation to eliminate all basal levels of 
transcription. Such an approach may combine one inducible component (e.g., the tet 
system from above) with a reversible transcriptional silencing component. For 
example, the KRAB repression domain of a certain zinc finger protein may be used. 

15 Briefly, KRAB (Kriippel-associated box) domains are highly conserved 

sequences present in the amino-terminal regions of more than one-third of all Kriippel- 
class Cys2His2 zinc finger proteins. The domains contain two predicted amphipathic 
ct-helicies and have been shown to function as DNA binding-dependent RNA 
polymerase II transcriptional repressors (for example, Licht et al., Nature 346: 76-79, 

20 1990). Like other transcription factors, the active repression domain and the DNA- 
binding domain are distinct and separable. Therefore, the repression domain can be 
linked as a fusion protein to any sequence specific DNA binding protein for targeting. 
Ideally, the DNA binding protein component can be reversibly prevented from binding 
. in a regulatable fashion, thus turning "off" the transcriptional silencing. For example, 

25 within one embodiment the KRAB domain from human Koxl (Thiesen, New BioL 
2:363-374, 1990) is fused to the DNA-binding lactose (lac) repressor protein, forming a 
hybrid transcriptional silencer with reversible, sequence-specific binding to a lac 
operator sequence engineered immediately adjacent to the tet-responsive promoter 
(Figure 30). In this configuration, constitutive expression of the lac repressor/KRAB 

30 domain fusion (rKR) will result in binding to the lac operator sequence and the 
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elimination of any "leaky" basal transcription from the uninduced tet-responsive 
promoter. When vector expression is desired and tetracycline is removed from the 
system, IPTG is added to prevent rKR-mediated transcriptional silencing. 

In addition, the KRAB domains from other zinc finger proteins, for 
5 example, ZNF133 (Tommerup et al., Hum. Mol Genet 2:1571-1575, 1993), ZNF91 
(Bellefroid et al., EMBO J. 72:1363-1374, 1993), ZNF2 (Rosati et al., Nucleic Acids 
Res. 79:5661-5667, 1991), and others, as well as other transferable repressor domains, 
for example, Drosophila en or eve genes (Jaynes and O'Farrell, EMBO J. 70:1427-1433, 
1991; Han and Manley, Genes Dev. 7:491-503, 1993), human zinc finger protein YY1 
0 (Shi et al., Cell 57:377-388, 1991), Wilms' tumor suppressor protein WT1 (Madden et 
al, Science 255:1550-1553, 199!), thyroid hormone receptor (Baniahmad et al., EMBO 
J. 77:1015-1023, 1992), retinoic acid receptor (Baniahmad et al., ibid). Kid-1 (Witzgall 
et al., Proc. Natl Acad Set USA 97:4514-4518, 1994), are readily used in such a 
system. Furthermore, the lac repressor/lac operator component of this system may be 
substituted by any number of other regulatable systems derived from other sources, for 
example, the tryptophan and maltose operons, GAL4, etc. 

Specifically, an expression cassette that contains the lac repressor (lacl) 
protein fused to the KRAB domain of human Koxl, with a linked nuclear localization 
sequence (NLS; Pro-Lys-Lys-Lys-Arg-Lys (SEQ. ID. NO. 100); Kalderon et al., Cell 
39:499-509, 1984) to more efficiently direct the protein back to the nucleus, is 
constructed by overlapping PCR. In PCR reaction #1, the approximately 1 100 bp lacl 
sequence is amplified by standard three-cycle PCR with a 1.5 minute extension, from 
template plasmid p3'SS (Stratagene, La Jolla, CA), using the following oligonucleotide 
primers that are designed to also contain a flanking Xho I site and AUG start codon in 
good translation initiation context on the upstream primer, and the SV40 large-T- 
antigen nuclear localization sequence on the other. 

Forward pri mer LacIS'F (S'-rest. site/AUG + lacl sequence) (SEQ. ID. NO. 95) 
5-ATATACTCGAGTAGCA/ATGGTGAAACCAGTAACGTTATAC 
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Reverse primer: Lac73'NLSR (5'-NLS/lacI sequence) (SEQ. ID. NO. 96) 
S'-GCCCTTTCTCTTC l 1 I I I TGG/CTGCCCGCTTTCCAGTCGGGAAAC 



In PCR reaction #2, the an approximately 400 bp amplicon, comprising the amino- 
5 terminal 121 residue KRAB domain of human Koxl is amplified by standard three- 
cycle PCR with a one minute extension, from template plasrnid pKoxl (Thiesen, New 
Biol 2:363-374, 1990), using the following oligonucleotide primers that are designed to 
also contain sequences overlapping NLS and lad on one primer and a Sac I restriction 
site and stop codon on the other. 

10 

Forward primer: KRAB5T (S'-NLS+lacl overlap sequence/KRAB sequence) 
(SEQ. ID. NO. 97) 

S'-CCAAAAAAGAAGAGAAAG/GGCGGTGGTGCTTTGTCTCCT 



15 Reverse primer: KRAB3'R (5-rest. site+stop codon/KRAB sequence) 
(SEQ. ID. NO. 98) 

S^ATATAGAGCTCTTA/AACTGATGATTTGATTTCAAATGC 

Following amplification, the DNA fragments are purified with QIAquick-spin and used 
20 together as templates in a subsequent three-cycle PCR reaction with 2.5 minute 
extension, using additional LacI5*F and KRAB3'R primers. The resulting overlapping 
PCR amplicon of approximately 1500 bp is purified using GENECLEAN II, digested 
with Xho I and Sac /, and ligated into the eukaryotic expression vector plasrnid pEUK- 
- CI (Clontech, Palo Alto, CA) that has also been digested with Xho I and Sac h treated 
25 with alkaline phosphatase, and purified from a 0.7% agarose gel using GENECLEAN 
II. The resulting lacI/KRAB expression construct is designated pEUK-rKR. 

To generate stable PCL transformants containing the /ac//KRAB 
expression cassette, an alphavirus PCL which has already been selected for 
transformation with an rTA tet/transactivator fusion protein cassette is used for starting 
30 material. For example, alphavirus C/GLYCO/TAk PCL cells (from above) are stably 
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transformed with plasmid pEUK-rKR by cotransfection with another plasmid encoding 
a selectable marker. Plasmids pEUK-rKR and pPUR, encoding a puromycin 
acetyltransferase selectable marker (Clontech), are co-transfected into C/GLYCO/TAk 
PCL cells (or other PCL) at a molar ratio of 40:1, respectively, using Lipofectamine, as 
5 described by the manufacturer. Approximately 24 hr post-transfection, the cells are 
trypsinized and re-plated in media containing 5 ug/ml puromycin and 0.5 ug/ml 
tetracycline. The media is exchanged periodically with fresh drug-containing media, 
and foci of resistant cells are allowed to grow. Cells are trypsinized and cloned by 
limiting dilution in 96 well tissue culture dishes, and individual cell clones are grown 

10 are expanded for screening. Positive pEUK-rKR-containing packaging cell clones, 
designated C/G/TAk/rKR cells, are identified by immunostaining with a polyclonal 
antiserum specific for lacl (Stratagene, La Jolla, CA). 

Next, specific lac operator (lacO) sequences must be inserted into the 
desired ptet-based alphavirus vector (see above). For example, vector construct 

15 ptetSINlgpt-luc is modified to contain multiple copies of lacO by using a synthetic 
oligonucleotide linker. The LacO oligonucleotide is designed to contain a symmetric 
lacO sequence, including the full 22 bp palindromic operator sequence (Simons et al., 
Proc. Natl. Acad. Sci. USA 57:1624-1628, 1984; Sadler et al., Proc. Natl. Acad. Sci. 
USA 50:6785-6789, 1983), and flanking Asc I sites when self-annealed into a double- 

20 stranded molecule. 

LacOsvmA (SEQ. ID. NO. 99) 
5'-CGCGCCGAATTGTGAGCGCTCACAATTCGG 

25 The LacOsymA oligo is sclf-annealed to form a Asc I "sticky-ended" DNA fragment, 
and then ligated into plasmid ptetSINlgpt-luc that has been digested with Asc I, treated 
with alkaline phosphatase, and purified from a 0.7% gel using GENECLEAN II. 
Clones containing one, two, three, or more tandem copies of the lacO sequence are 
identified by sequence analysis, and given the designation pOItetSINlgpt-luc, 

30 pOIItetSINlgpt-luc, pOIIItetSINlgpt-luc, etc. Individual clones with different lacO 
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copy numbers are then transfected as detailed below, and tested for the tightest level of 
transcriptional regulation. 

To generate an alphavirus vector producer cell line, the. DNA-based 
pOtetSINlgpt-luc vector constructs are stably transfected into C/G/TAk/rKR cells using 
5 Lipofectamine, as described by the manufacturer. Approximately 24 hr post- 
transfection, the cells are trypsinized and re-plated in selection media, optimized for the 
particular cell type (DMEM + 10% dialyzed fetal calf serum; 250 ug/ml xanthine; 15 
ug/ml hypoxanthine; 10 ug/ml thymidine; 2 ug/ml aminopterin; 25 ug/ml mycophenolic 
acid), and containing 0.5 ug/ml tetracycline. The media is exchanged periodically with 

10 fresh selection media, and foci of resistant cells are allowed to grow. Cells are 
trypsinized and cloned by limiting dilution in 96 well tissue culture dishes, and 
individual cell clones are grown are expanded for screening. Positive producer cell 
lines, stably transformed with the pOtetSINlgpt-luc constructs, are identified by the 
expression of luciferase (described previously) at least 24 hr after the removal of 

15 tetracycline from the media and the addition of 20mM IPTG for induction. Luciferase 
activity is determined both on producer cell lysates and also after transfer-of-expression 
experiments using culture supernatants. 

Additional levels of control may be incorporated by adding a third, or 
even fourth, level of regulation to the promoter responsible for transcription of the 

20 alphavirus vector molecule. Such extra level or regulation may be incorporated into the 
minimal promoter, and may involve other inducible systems and/or cell differentiation 
control. In each of the above cases, stable transformation may be accomplished as an 
integration into the host cell chromosome, or as an extrachromosomal episome, using 
for example, the EBV episomal-based vector promoter (for non-integrated). 
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EXAMPLE 8 

Methods for the Generation of Alphavirus-Deri ved Empty 
or Chimeric Viral Particles 

5 As illustrated in Example 6, individual defective helper (DH) expression 

cassettes can be constructed to contain elements from multiple alphaviruses or their 
variants. Thus, as described in Example 6, split structural gene DH cassettes for the 
expression of the viral glycoproteins can be constructed to contain the capsid and 
glycoprotein genes from different alphavirus species. For example, such a heterologous 

10 alphavirus glycoprotein DH cassette might contain the capsid gene from Ross River 
virus (RRV), and the glycoprotein genes from Sindbis virus. In this configuration, the 
RRV capsid gene serves to enhance the level of translation of the glycoprotein genes. 

The configurations described herein for the heterologous alphavirus 
glycoprotein DH cassettes are designed to improve the packaging of vector replicons 

15 into alphavirus particles, yet diminish the possibility of recombination, resulting in the 
formation of replication competent alphavirus. The heterologous alphavirus 
glycoprotein DH expression cassette is a replacement of the Sindbis virus capsid gene 
in the DH expression cassettes described in Example 6 ("genomic" structural protein 
gene PCL), with a heterologous alphavirus capsid gene (e.g. RRV). The second DH 

20 expression cassette in the split structural gene PCL contains, for example, the Sindbis 
virus capsid gene. Thus, a split structural gene PCL for the generation of recombinant 
alphavirus vector particles having Sindbis virus structural proteins can be derived, for 
example, with the Sindbis virus glycoprotein genes and capsid genes on individual DH 
expression cassettes. 

25 It has been shown previously that chimeric viruses containing all of the 

genes of RRV, but with the capsid gene from Sindbis virus, or the reciprocal chimeric 
virus, do not assemble into infectious virus particles (Lopez et. al M J. Virol 65:1316- 
1323, 1994). The authors concluded in this report that the interaction between the 
carboxy terminus of glycoprotein E2 and capsid protein in virus assembly cannot occur 

30 between the structural proteins of heterologous alphaviruses. Thus, recombinant 
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genomes arising in the split structural gene alphavirus PCLs described in Example 6, 
consisting of the Sindbis virus non-structural protein genes (originating from the vector 
replicon), the RRV capsid gene, and the Sindbis virus glycoprotein genes, should not be 
replication competent, resulting in the propagation of virus (replication competent 
5 Sindbis virus, RCSV). The packaging restriction between heterologous alphavirus 
species permits the construction of DH cassettes comprised of the capsid gene, 
including the translational enhancement element, from one alphavirus, and the 
glycoprotein genes from a different alphavirus.- 

However, as illustrated in Figure 31, the observation of Lopez et. al. 

10 (ibid), that assembly cannot occur between the structural proteins of heterologous 
alphaviruses, is incorrect. Indeed, a DH cassette consisting of the RRV capsid gene, 
and the Sindbis virus glycoprotein gene produces infectious virus particles. Briefly, 
BHK cells were co-electroporated with SINrep/Lac Z replicon (Bredenbeek et. al., J. 
Virol, 67:6439-6446, 1993), and DH-BB (5* tRN A/SIN) Crrv (Example 6 and Figure 

15 24; RRV capsid/Sindbis virus glycoproteins) in vitro transcribed RNAs. The 
electroporation and in vitro transcriptions were performed as described in Example* 1 . 
Following electroporation, the BHK cells were treated with dactinomycin and labeled 
with [ 3 H]uridine, exactly as described in Example 1 . At 1 8 hours post electroporation, 
the culture medium was collected, and clarified by centrifiigation at 6,000 rpm for 10 

20 min. The vector particles remaining in the supernatant were pelleted by 
ultracentrifugation, after first layering over a sucrose cushion. RNA was isolated from 
the BHK cells at 18 hours post electroporation, and from the virus pellet, 
electrophoresed on denaturing glyoxal agarose gels, and visualized by autoradiography, 
- exactly as described in Example 1. The viral RNAs present in BHK cells electoporated 

25 with SINrep/LacZ and DH-BB Crrv RNAs, and in virus particles, are shown in Figure 
31 (lane 1, panel A, and lane 1, panel B). RNAs corresponding to the genomic and 
subgenomic replicative species for SINrep/LacZ and DH-BB Crrv RNAs were present 
in both electroporated BHK cells, and the produced virus particles. The results 
demonstrate, in contrast to Lopez et al. (ibid), the formation of chimeric alphavirus 

30 particles consisting of RRV capsid protein and Sindbis virus glycoproteins. Further, the 
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indiscriminate packaging of genomic and subgenomic SINrep/LacZ and DH-BB Crrv 
RNAs in chimeric alphavirus particles indicates the inability of the Ross River capsid 
protein to recognize specifically the Sindbis virus packaging sequence, which is present 
in the nsPl gene of the SINrep/LacZ vector replicon. 

5 The viral proteins present in BHK cells electoporated with SINrep/LacZ 

and DH-BB Crrv RNAs, at 18 hours post electroporation, and the produced virus 
particles are given in Figure 32 (lane 1, panel A, and lane 1, panel B). The viral-specific 
structural proteins in electroporated cells and the produced chimeric alphavirus particles 
were indistinguishable. That is, Figure 32 demonstrates clearly that virus particles 

0 produced from BHK cells electroporated with SINrep/LacZ and DH-BB Crrv RNAs 
contained the RRV capsid and the Sindbis virus glycoproteins El and E2. This result 
provides indisputable evidence that in contrast to Lopez et. al„ (ibid), there is no 
restriction in assembly between heterologous alphavirus capsid and glycoproteins that 
prevents the formation of chimeric viral particles. 

5 Thus, in distinct contrast to the results and discussion of Lopez et. al., the 

amino terminus of the RRV capsid protein is able to bind with the heterologous Sindbis 
virus genome, and form infectious chimeric alphavirus particles. Importantly, the 
previous conclusion that there is a restriction of virus assembly between heterologous 
alphavirus capsid proteins and glycoproteins is incorrect. The generation of chimeric 

0 alphavirus particles as described here would, then, also result in the formation of RCSV 
in the split structural gene PCLs described above, since a recombinant genome 
consisting of the Sindbis virus non-structural protein genes (originating from the vector 
replicon), the RRV capsid gene, and the Sindbis virus glycoprotein genes, would 
* generate infectious virus. Alternatively, this lack of restriction of packaging between 

5 distinct alphavirus structural proteins and vector replicons permits the tropism of vector 
particles to be modified. For example, Sindbis virus replicons can be packaged with the 
Venezuelan Equine Ecephalitis virus structural proteins, in order to generate a 
lymphotropic recombinant vector particle. 

The results described in two separate previous investigations have shown 

0 that ablation, in vitro, of the interaction between the capsid protein and the positive 
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RNA-stranded genome of two icosahedral viruses having triangulation numbers (T)=3, 
turnip crinkle virus (TCV), and southern bean mosaic virus (SBMV), resulted in the 
disassociation of the virus particles, and the formation of nucleic acid-free T=l particles 
(Sorger et. al. J. Mol. Biol., 191:639-656, 1986, and Erickson and Rossmann, Virology 
5 1 16:128-136, 1982). In the absence of nucleic acid, T=3 particles similar to wild-type 
virus were not formed in vitro, Owen and Kuhn (J. Virol., 70:2757-2763, 1996), 
investigated the packaging properties of Sindbis virus genomes containing deletions in 
the capsid, in order to identify the region of the capsid protein that is required for 
dictating specificity of the encapsidation reaction, in vivo. One mutant virus [CD(97- 

10 106)] which contained a deletion corresponding to residues 97-106 of the capsid, 
encapsidated both genomic and subgenomic RNAs, indicating the domain of the capsid 
protein required for specific recognition of the genomic RNA packaging signal. In yet 
another report, the packaging properties of Aura alphavirus were investigated 
(Rumenapf et. al. J. Virol., 69:1741-1746, 1995). In this study, a mechanism for 

1 5 alphavirus packaging that involves a capsid protein-encapsidation sequence interaction 
initiation complex was proposed. This mechanism proposed is based on observations 
by the authors, and others (including Owen and Kuhn, ibid), in which 26S and 49S 
alphavirus RNAs are packaged into T=l, T=3, T=4, and T=7 virus particles, and that 
empty capsids arising during infection with alphaviruses have not been reported. 

20 Based on the literature presented above and the discussions contained 

therein, a RRV capsid gene deleted of the region corresponding to the capsid protein 
domain that is required for dictating specificity of the encapsidation reaction, and, in 
addition, surrounding basic residues that bind electrostatically with viral RNA, should 
not be able to form stable capsid particles containing viral RNAs. Thus, the alphavirus 

25 structural proteins expressed from a heterologous alphavirus DH cassette, consisting of 
this deleted RRV capsid gene and the Sindbis virus glycoprotein genes, should not 
assemble into stable chimeric alphavirus particles. Thus, in the split structural gene 
PCL discussed above and in Example 6, a recombinant genome consisting of the 
Sindbis virus non-structural protein genes, the RRV capsid gene (deleted of the region 

30 corresponding to packaging specificity and the surrounding basic residues), and the 
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Sindbis virus glycoprotein genes, could not generate infectious virus. As described in 
Example 6, the Sindbis virus capsid protein is expressed from a separate DH expression 
cassette; thus, the three Sindbis virus structural proteins are expressed in toto, resulting 
in the production of recombinant vector particles. 
5 Nucleotides of the RRV capsid gene corresponding to predicted regions 

of the expressed protein that bind to the Sindbis virus packaging sequence (Weiss et. al., 
Nuc. Acids, Res., 22:780-786, 1994, and Lopez et. al., ibid), including the basic 
residues which bind electrostatically with the viral RNA, were deleted in order to 
construct a heterologous alphavirus capsid-glycoprotein DH that provided translation^ 

10 enhancement and correct pE2-6K-El polyprotein processing by post-translational 
cleavage, yet could not assemble stable chimeric RRV/Sindbis virus particles. Figure 
33 illustrates the hydrophobicity profiles (Kyte-Dolittle) of the RRV capsid protein, and 
the capsid protein expressed from 3 individual RRV capsid gene mutants (CAlrrv. 
CA2rrv, and CA3rrv), in which varying amounts of the capsid gene encoding a lysine- 

15 rich protein that interacts with the viral packaging sequence RNA. was deleted. The 
lysine-rich basic region of the RRV capsid protein is shown in Figure 33. Further, the 
hydrophobicity profiles demonstrate that this lysine-rich basic region is progressively 
eliminated in the 3 individual RRV capsid gene mutants CAlrrv, CA2rrv, and CA3rrv. 
Figure 34 demonstrates the lysine residues eliminated in the expressed RRV capsid 
20 protein, as a result of the deletions in mutants CAlrrv, CA2rrv, and CA3rrv. The table 
shown below gives the nucleotides deleted in the RRV genome of constructs CAlrrv, 
CA2rrv, and CA3rrv. 



Construct Deleted RRV genome nts. 
CAlrrv 7841-7891 
CA2rrv 7796-7891 
CA3rrv 7760-7891 

25 The RRV capsid gene deletions were constructed on the DH-BB Crrv plasmid DNA 
illustrated in Figure 23, and described in Example 6. The indicated RRV capsid gene 
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sequences were deleted by PCR, using the primers and other cloning steps given in 
Example 6. 

Figures 31 and 32, discussed above, illustrate the virus-specific RNAs 
(Figure 31) and proteins (Figure 32) synthesized in BHK cells electroporated with 
5 SINrep/LacZ and the DH-BB CAlrrv, CA2rrv, or CA3rrv RNAs, and present in viral 
particles contained in the culture fluids of these cells. The genomic and subgenomic 
species were detected for both the SINrep/LacZ replicon and all the three DH-BB 
CAlrrv, CA2rrv, or CA3rrv DH RNAs in electroporated cells (Figure 31, panel A). 
However, the SINrep/lacZ replicon was not packaged in vector particles in cells 

10 electroporated with DH RNA containing deletions in the RRV capsid gene, as 
demonstrated by the absence of replicon genomic RNA in virus particles (Figure 3 1 , 
panel B). Further, helper genomic and subgenomic RNAs were packaged very 
inefficiently and were barely visible in autoradiograms of denaturing gels (Figure 31, 
panel B, lanes 3 and 4), when cells were electroporated with DH molecules containing 

1 5 larger deletions of the RRV capsid (CA2rrv or CA3rrv). In contrast, while SINrep/lacZ 
genomic RNA (and DH RNA in electroporations with CA2rrv or CA3rrv) was not 
detected in viral particles from BHK cells electroporated with the DHs containing 
deletions in the RRV capsid gene, equivalent RRV capsid protein and Sindbis virus 
glycoprotein levels were observed in virus particles from cells electroporated with all 

20 DH RNAs, regardless of whether the RRV capsid gene contained deletions (Figure 32, 
panels A and B). This result demonstrates that stable chimeric virus particles not 
containing vector replicon, or other viral-specific RNAs, were formed in BHK cells 
electroporated with SINrep/lacZ genomic RNA and DH RNA, from which capsid 
protein unable to bind to the genomic RNA was expressed. The formation of stable 

25 empty heterologous alphavirus particles is unexpected and not predicted, based on the 
results and discussions of previous investigations (Lopez et. al., J, Virol 65:1316-1323, 
1994, Sorger et. al. 1 Mol Biol, 797:639-656, 1986, Erickson and Rossmann, Virology 
776:128-136, 1982, and Rumenapfet al. J. ViroL 69:1741-1746, 1995). 

To determine the composition of the virus particles produced, BHK cells 

30 were electroporated with SINrep lacZ and the DH RNAs containing various RRV 
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capsid gene configurations, as described herein, and were treated subsequently with 
dactinomycin, and labeled with [ 35 S]methionine and [ 3 H]uridine, as described in 
Example 1. The configuration of the viral particles produced were determined by 
ultracentrifugation of clarified cell culture media for 2 hrs at 35,000 rpm in a SW-41 
5 rotor over a 20%-40% (w/w) sucrose gradient. The results of this study are shown in 
Figures 35-37, and demonstrate again the formation of stable empty heterologous 
alphavirus particles. Figure 35 demonstrates the relative levels of ["SJmethionine and 
[ 3 H]uridine incorporated into particles synthesized in BHK cells infected at high MOI 
(5) with wild-type virus, TotollOl. Figure 36 demonstrates that the relative levels of 

10 [ 35 S]methionine and [ 3 H]uridine incorporated into particles synthesized in BHK cells 
electroporated with SINrep/LacZ and DH-BB (5' tRNA) Crrv (Figure 23) RNAs was 
the same as in cells infected with wild-type virus. In contrast, Figure 37 demonstrates 
that the particles produced in BHK cells electroporated with SINrep/LacZ and DH-BB 
(5 1 tRNA) CA3rrv contained very low levels of incorporated [ 3 H]uridine. Figure 38 is a 

15 compilation of Figures 35-37, and illustrates clearly that while the relative levels of 
[ 35 S]methionine and [ 3 H]uridine incorporated into particles were similar in BHK cells 
infected or electroporated with RNAs containing wild-type alphavirus capsid genes, 
BHK cells electroporated with DH RNA containing deletions of nts. 7760-7891 of the 
RRV capsid gene formed stable chimeric empty alphavirus particles, devoid of 

20 SINrep/LacZ RNA. Titers of empty alphavirus particles, produced in cell lines 
electroporated with with SINrep/LacZ RNA and DH-BB CD3rrv in vitro transcribed 
RNAs, and labeled with ["SJmethionine, were determined by comparison with BHK 
cells, infected with Toto 1 101 wild-type virus, and labeled with [35S]methionine. The 
• level of radioactivity present in virus-containing sucrose gradient fractions from Toto 

25 1 101 -infected cells was quanitated, and related to the virus titer present in these same 
fractions, as determined by plaque assay, according to the methods described in 
Example 1. For empty alphavirus particle titer determinations, the level of radioactivity 
present in vims particle-containing sucrose gradient fractions from BHK cells 
electroporated with SINrep/LacZ RNA and DH-BB CD3rrv in vitro transcribed RNAs, 

30 was quantitated, and related to the [ 35 S]methionine/virus titer, from Toto 1101 infected 
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cells. The titer of empty chimeric virus particles, containing the deleted Ross River 
virus capsid and the Sindbis virus glycoproteins, produced in SINrep/LacZ RNA and 
DH-BB CD3rrv in vitro transcribed RNA electroporated cells was 1 x 10* particles/ml. 

5 While the present invention has been described above both generally and 

in terms of preferred embodiments, it is understood that variations and modifications 
will occur to those skilled in the art in light of the description, supra. Therefore, it is 
intended that the appended claims cover all such variations coming within the scope of 
the invention as claimed. 

10 Additionally, the publications and other materials cited to illuminate the 

background of the invention, and in particular, to provide additional details concerning 
its practice as described in the detailed description and examples, are hereby 
incorporated by reference in their entirety. 
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SEQUENCE LISTING 



(1) GENERA. INFORWTION: 

(i) APPLICANTS: Chiron Viagene. Inc. 

Washington University 

(ii) TITLE OF INVENTION: RECOMBINANT ALPHAVIRUS-BASEQ VECTORS 

WITH REDUCED INHIBITION OF CELLULAR hV^WOLECULAR 
SYNTHESIS 

(in) NUMBER OF SECUENCES: 118 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coipatible 

(C) OPERATING SYSTEM: PC-DGS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 04-APR-1997 
• (0 CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: McMasters. David D. 

(B) REGISTRATION NUMBER: 33.963 

(C) REFERENCE/DOCKET NUMBER: 930049. 457PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : (206) 622-4900 

(B) TELEFAX: (206) 682-6031 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE OHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATC1CTAGGG TC£TOCTAAA TAGT 24 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
GGTGGASCTC TMTACGACT CACTATAGAT TGACQGCGTA GHACACAC 48 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 3 : 
MTTTCTGCC TCAGC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHfiRACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TATGCAAAGT TACTGAC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

CTGTCATTAC TTCATGTC 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
•(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
G03TGGATCA CTTTC 15 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATTGOGTGAT TTCGTCCGT 19 
(2) INFORMATICN FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TAWTTGAG CTTTG 15 
(2) INFORMATION FOR SEQ ID N0:9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(0 STRANDEDNESS: single 
(D) "TOPOLOGY: linear' 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
QQCATATCQC ATOGTTG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTGGCCATQG AAGGAAAGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCCCTCGAQG GTTTTTTTTT 1111111111 TTGAAATGTT AAAAACAAAA TTTTGTTG 
(2) INFORMATION FOR SEQ ID NO: 12: 



58 
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(i) SEQUENCE CHAR/OERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TATATQGGCC CGATTTfiQGT GACACTATAG ATTGAOGSCG TAGTACAC 48 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTGGCAACCG GTAAGTACGA TAG 23 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE QWRACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

* " (0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
ATACTAGCCA OGGCCGGTAT C 



item 



21 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCCTCTTTCG ADGTGTCGAG C 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
/CCTTGfiAGC GCAATGTCCT G 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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SNBS™-Luciferase Vector Packaging by 
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E2 transfection 
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CTGTTCCAAA TGTGCCACAG ATACCGTATA AGGCACTTGT TGAAAGGGCA GGGTATGCCC 10140 

CGCTCAATTT GGAGATCACT GTCATGTCCT CGGAGGTTTT GCCTTCCACC AACCAAGAGT 10200 

ACATTACCTG CAAATTCACC ACTGTGGTCC CCTCCCCAAA AATCAAATGC TGCGGCTCCT 10260 

TGGAATGTCA GCCGGCCGCT CATGCAGACT ATACCTGCAA GGTCTTCGGA GGGGTCTACC 10320 

CCTTTATGTG GGGAGGAGCG CAATGTTTTT GCGACAGTGA GAACAGCCAG ATGAGTGAGG 10380 

CGTACGTCGA ATTGTCAGCA GATTGCGCGT CTGACCACGC GCAGGCGATT AAGGTGCACA 10440 

CTGCCGCGAT GAAAGTAGGA CTGCGTATTG TGTACGGGAA CACTACCAGT TTCCTAGATG 10500 

TGTACGTGAA CGGAGTCACA CCAGGAACGT CTAAAGACTT GAAAGTCATA GCTGGACCAA 1 0560 

TTTCAGCATC GTTTACGCCA TTCGATCATA AGGTCGTTAT CCATCGCGGC CTGGTGTACA 10620 

ACTATGACTT CCCGGAATAT GGAGCGATGA AACCAGGAGC GTTTGGAGAC ATTCAAGCTA 10680 

CCTCCTTGAC TAGCAAGGAT CTCATCGCCA GCACAGACAT TAGGCTACTC AAGCCTTCCG 10740 

CCAAGAACGT GCATGTCCCG TACACGCAGG CCTCATCAGG ATTTGAGATG TGGAAAAACA 10800 

ACTCAGGCCG CCCACTGCAG GAAACCGCAC CTTTCGGGTG TAAGATTGCA GTAAATCCGC 10860 

TCCGAGCGGT GGACTGTTCA TACGGGAACA TTCCCATTTC TATTGACATC CCGAACGCTG 10920 

CCTTTATCAG GACATCAGAT GCACCACTGG TCTCAACAGT CAAATGTGAA GTCAGTGAGT 10980 

GCACTTATTC AGCAGACTTC GGCGGGATGG CCACCCTGCA GTATGTATCC GACCGCGAAG 11040 

GTCAATGCCC CGTACATTCG CATTCGAGCA CAGCAACTCT CCAAGAGTCG ACAGTACATG 1 1 1 00 

TCCTGGAGAA AGGAGCGGTG ACAGTACACT TTAGCACCGC GAGTCCACAG GCGAACTTTA 11160 

TCGTATCGCT GTGTGGGAAG AAGACAACAT GCAATGCAGA ATGTAAACCA CCAGCTGACC 11220 

ATATCGTGAG CACCCCGCAC AAAAATGACC AAGAATTTCA AGCCGCCATC TCAAAAACAT 11280 

CATGGAGTTG GCTGTTTGCC CTTTTCGGCG GCGCCTCGTC GCTATTAATT ATAGGACTTA 11340 

TGATTTTTGC TTGCAGCATG ATGCTGACTA GCACACGAAG ATGACCGCTA CGCCCCAATG 11400 

ATCCGACCAG CAAAACTCGA TGTACTTCCG AGGAACTGAT GTGCATAATG CATCAGGCTG 11460 

GTACATTAGA TCCCCGCTTA CCGCGGGCAA TATAGCAACA CTAAAAACTC GATGTACTTC 11520 

CGAGGAAGCG CAGTGCATAA TGCTGCGCAG TGTTGCCACA TAACCACTAT ATTAACCATT 11580 

TATCTAGCGG ACGCCAAAAA CTCAATGTAT TTCTGAGGAA GCGTGGTGCA TAATGCCACG 11640 

CAGCGTCTGC ATAACTTTTA TTATTTCTTT TATTAATCAA CAAAATTTTG TTTTTAACAT 11700 

TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 11740 



Fig. 8E 
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TCAGAGGGGA AATAAAGCAT CTCTACGGTG GTCCTAAATA GTCAGCATAG TACATTTCAT 7620 

CTGACTAATA CTACAACACC ACCACCATGA ATAGAGGATT CTTTAACATG CTCGGCCGCC 7680 

GCCCCTTCCC GGCCCCCACT GCCATGTGGA GGCCGCGGAG AAGGAGGCAG GCGGCCCCGA 7740 

TGCCTGCCCG CAACGGGCTG GCTTCTCAAA TCCAGCAACT GACCACAGCC GTCAGTGCCC 7800 

TAGTCATTGG ACAGGCAACT AGACCTCAAC CCCCACGTCC ACGCCCGCCA CCGCGCCAGA 7860 

AGAAGCAGGC GCCCAAGCAA CCACCGAAGC CGAAGAAACC AAAAACGCAG GAGAAGAAGA 7920 

AGAAGCAACC TGCAAAACCC AAACCCGGAA AGAGACAGCG CATGGCACTT AAGTTGGAGG 7980 

CCGACAGATT GTTCGACGTC AAGAACGAGG ACGGAGATGT CATCGGGCAC GCACTGGCCA 8040 

TGGAAGGAAA GGTAATGAAA CCTCTGCACG TGAAAGGAAC CATCGACCAC CCTGTGCTAT 8100 

CAAAGCTCAA ATTTACCAAG TCGTCAGCAT ACGACATGGA GTTCGCACAG TTGCCAGTCA 8160 

ACATGAGAAG TGAGGCATTC ACCTACACCA GTGAACACCC CGAAGGATTC TATAACTGGC 8220 

ACCACGGAGC GGTGCAGTAT AGTGGAGGTA GATTTACCAT CCCTCGCGGA GTAGGAGGCA 8280 

GAGGAGACAG CGGTCGTCCG ATCATGGATA ACTCCGGTCG GGTTGTCGCG ATAGTCCTCG 8340 

GTGGCGCTGA TGAAGGAACA CGAACTGCCC TTTCGGTCGT CACCTGGAAT AGTAAAGGGA 8400 

AGACAATTAA GACGACCCCG GAAGGGACAG AAGAGTGGTC CGCAGCACCA CTGGTCACGG 8460 

CAATGTGTTT GCTCGGAAAT GTGAGCTTCC CATGCGACCG CCCGCCCACA TGCTATACCC 8520 

GCGAACCTTC CAGAGCCCTC GACATCCTTG AAGAGAACGT GAACCATGAG GCCTACGATA 8580 

CCCTGCTCAA TGCCATATTG CGGTGCGGAT CGTCTGGCAG AAGCAAAAGA AGCGTCATTG 8640 

ACGACTTTAC CCTGACCAGC CCCTACTTGG GCACATGCTC GTACTGCCAC CATACTGTAC 8700 

CGTGCTTCAG CCCTGTTAAG ATCGAGCAGG TCTGGGACGA AGCGGACGAT AACACCATAC 8760 

GCATACAGAC TTCCGCCCAG TTTGGATACG ACCAAAGCGG AGCAGCAAGC GCAAACAAGT 8820 

ACCGCTACAT GTCGCTTAAG CAGGATCACA CCGTTAAAGA AGGCACCATG GATGACATCA 8880 

AGATTAGCAC CTCAGGACCG TGTAGAAGGC TTAGCTACAA AGGATACTTT CTCCTCGCAA . 8940 

AATGCCCTCC AGGGGACAGC GTAACGGTTA GCATAGTGAG TAGCAACTCA GCAACGTCAT 9000 

GTACACTGGC CCGCAAGATA AAACCAAAAT TCGTGGGACG GGAAAAATAT GATCTACCTC 9060 

CCGTTCACGG TAAAAAAATT CCTTGCACAG TGTACGACCG TCTGAAAGAA ACAACTGCAG 9120 

GCTACATCAC TATGCACAGG CCGAGACCGC ACGCTTATAC ATCCTACCTG GAAGAATCAT 9180 

CAGGGAAAGT TTACGCAAAG CCGCCATCTG GGAAGAACAT TACGTATGAG TGCAAGTGCG 9240 

GCGACTACAA GACCGGAACC GTTTCGACCC GCACCGAAAT CACTGGTTGC ACCGCCATCA 9300 

AGCAGTGCGT CGCCTATAAG AGCGACCAAA CGAAGTGGGT CTTCAACTCA CCGGACTTGA 9360 

TCAGACATGA CGACCACACG GCCCAAGGGA AATTGCATTT GCCTTTCAAG TTGATCCCGA 9420 

GTACCTGCAT GGTCCCTGTT GCCCACGCGC CGAATGTAAT ACATGGCTTT AAACACATCA 9480 

GCCTCCAATT AGATACAGAC CACTTGACAT TGCTCACCAC CAGGAGACTA GGGGCAAACC 9540 

CGGAACCAAC CACTGAATGG ATCGTCGGAA AGACGGTCAG AAACTTCACC GTCGACCGAG 9600 

* ATGGCCTGGA ATACATATGG GGAAATCATG AGCCAGTGAG GGTCTATGCC CAAGAGTCAG 9660 

CACCAGGAGA CCCTCACGGA TGGCCACACG AAATAGTACA GCATTACTAC CATCGCCATC 9720 

CTGTGTACAC CATCTTAGCC GTCGCATCAG CTACCGTGGC GATGATGATT GGCGTAACTG 9780 

TTGCAGTGTT ATGTGCCTGT AAAGCGCGCC GTGAGTGCCT GACGCCATAC GCCCTGGCCC 9840 

CAAACGCCGT AATCCCAACT TCGCTGGCAC TCTTGTGCTG CGTTAGGTCG GCCAATGCTG 9900 

AAACGTTCAC CGAGACCATG AGTTACTTGT GGTCGAACAG TCAGCCGTTC TTCTGGGTCC 9960 

AGTTGTGCAT ACCTTTGGCC GCTTTCATCG TTCTAATGCG CTGCTGCTCC TGCTGCCTGC 1 0020 

CTTTTTTAGT GGTTGCCGGC GCCTACCTGG CGAAGGTAGA CGCCTACGAA CATGCGACCA 10080 
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CGCACACTCC CGCATTCGTT CCCGCCCGTA AGTACATAGA AGTGCCAGAA CAGCCTACCG 5100 

CTCCTCCTGC ACAGGCCGAG GAGGCCCCCG AAGTTGTAGC GACACCGTCA CCATCTACAG 5160 

CTGATAACAC CTCGCTTGAT GTCACAGACA TCTCACTGGA TATGGATGAC AGTAGCGAAG 5220 

GCTCACTTTT TTCGAGCTTT AGCGGATCGG ACAACTCTAT TACTAGTATG GACAGTTGGT 5280 

CGTCAGGACC TAGTTCACTA GAGATAGTAG ACCGAAGGCA GGTGGTGGTG GCTGACGTTC 5340 

ATGCCGTCCA AGAGCCTGCC CCTATTCCAC CGCCAAGGCT AAAGAAGATG GCCCGCCTGG 5400 

CAGCGGCAAG AAAAGAGCCC ACTCCACCGG CAAGCAATAG CTCTGAGTCC CTCCACCTCT 5460 

CTTTTGGTGG GGTATCCATG TCCCTCGGAT CAATTTTCGA CGGAGAGACG GCCCGCCAGG 5520 

CAGCGGTACA ACCCCTGGCA ACAGGCCCCA CGGATGTGCC TATGTCTTTC GGATCGTTTT 5580 

CCGACGGAGA GATTGATGAG CTGAGCCGCA GAGTAACTGA GTCCGAACCC GTCCTGTTTG 5640 

GATCATTTGA ACCGGGCGAA GTGAACTCAA TTATATCGTC CCGATCAGCC GTATCTTTTC 5700 

CACTACGCAA GCAGAGACGT AGACGCAGGA GCAGGAGGAC TGAATACTGA CTAACCGGGG 5760 

TAGGTGGGTA CATATTTTCG ACGGACACAG GCCCTGGGCA CTTGCAAAAG AAGTCCGTTC 5820 

TGCAGAACCA GCTTACAGAA CCGACCTTGG AGCGCAATGT CCTGGAAAGA ATTCATGCCC 5880 

CGGTGCTCGA CACGTCGAAA GAGGAACAAC TCAAACTCAG GTACCAGATG ATGCCCACCG 5940 

AAGCCAACAA AAGTAGGTAC CAGTCTCGTA AAGTAGAAAA TCAGAAAGCC ATAACCACTG 6000 

AGCGACTACT GTCAGGACTA CGACTGTATA ACTCTGCCAC AGATCAGCCA GAATGCTATA 6060 

AGATCACCTA TCCGAAACCA TTGTACTCCA GTAGCGTACC GGCGAACTAC TCCGATCCAC 6120 

AGTTCGCTGT AGCTGTCTGT AACAACTATC TGCATGAGAA CTATCCGACA GTAGCATCTT 6180 

ATCAGATTAC TGACGAGTAC GATGCTTACT TGGATATGGT AGACGGGACA GTCGCCTGCC 6240 

TGGATACTGC AACCTTCTGC CCCGCTAAGC TTAGAAGTTA CCCGAAAAAA CATGAGTATA 6300 

GAGCCCCGAA TATCCGCAGT GCGGTTCCAT CAGCGATGCA GAACACGCTA CAAAATGTGC 6360 

TCATTGCCGC AACTAAAAGA AATTGCAACG TCACGCAGAT GCGTGAACTG CCAACACTGG .6420 

ACTCAGCGAC ATTCAATGTC GAATGCTTTC GAAAATATGC ATGTAATGAC GAGTATTGGG 6480 

AGGAGTTCGC TCGGAAGCCA ATTAGGATTA CCACTGAGTT TGTCACCGCA TATGTAGCTA 6540 

GACTGAAAGG CCCTAAGGCC GCCGCACTAT TTGCAAAGAC GTATAATTTG GTCCCATTGC 6600 

AAGAAGTGCC TATGGATAGA TTCGTCATGG ACATGAAAAG AGACGTGAAA GTTACACCAG 6660 

GCACGAAACA CACAGAAGAA AGACCGAAAG TACAAGTGAT ACAAGCCGCA GAACCCCTGG 6720 

CGACTGCTTA CTTATGCGGG ATTCACCGGG AATTAGTGCG TAGGCTTACG GCCGTCTTGC 6780 

- TTCCAAACAT TCACACGCTT TTTGACATGT CGGCGGAGGA TTTTGATGCA ATCATAGCAG 6840 

AACACTTCAA GCAAGGCGAC CCGGTACTGG AGACGGATAT CGCATCATTC GACAAAAGCC 6900 

AAGACGACGC TATGGCGTTA ACCGGTCTGA TGATCTTGGA GGACCTGGGT GTGGATCAAC 6960 

CACTACTCGA CTTGATCGAG TGCGCCTTTG GAGAAATATC ATCCACCCAT CTACCTACGG 7020 

GTACTCGTTT TAAATTCGGG GCGATGATGA AATCCGGAAT GTTCCTCACA CTTTTTGTCA 7080 

. -ACACAGTTTT GAATGTCGTT ATCGCCAGCA GAGTACTAGA AGAGCGGCTT AAAACGTCCA 7140 

. GATGTGCAGC GTTCATTGGC GACGACAACA TCATACATGG AGTAGTATCT GACAAAGAAA 7200 

TGGCTGAGAG GTGCGCCACC TGGCTCAACA TGGAGGTTAA GATCATCGAC GCAGTCATCG 7260 

GTGAGAGACC ACCTTACTTC TGCGGCGGAT TTATCTTGCA AGATTCGGTT ACTTCCACAG 7320 

CGTGCCGCGT GGCGGATCCC CTGAAAAGGC TGTTTAAGTT GGGTAAACCG CTCCCAGCCG 7380 

ACGACGAGCA AGACGAAGAC AGAAGACGCG CTCTGCTAGA TGAAACAAAG GCGTGGTTTA 7440 

GAGTAGGTAT AACAGGCACT TTAGCAGTGG CCGTGACGAC CCGGTATGAG GTAGACAATA 7500 

TTACACCTGT CCTACTGGCA TTGAGAACTT TTGCCCAGAG CAAAAGAGCA TTCCAAGCCA 7560 
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CCATGCAATG CGGATTCTTC AACATGATGC AACTAAAGGT ACATTTCAAT CACCCTGAAA 2580 

AAGACATATG CACCAAGACA TTCTACAAGT ATATCTCCCG GCGTTGCACA CAGCCAGTTA 2640 

CAGCTATTGT ATCGACACTG CATTACGATG GAAAGATGAA AACCACGAAC CCGTGCAAGA 2700 

AGAACATTGA AATCGATATT ACAGGGGCCA CAAAGCCGAA GCCAGGGGAT ATCATCCTGA 2760 

CATGTTTCCG CGGGTGGGTT AAGCAATTGC AAATCGACTA TCCCGGACAT GAAGTAATGA 2820 

CAGCCGCGGC CTCACAAGGG CTAACCAGAA AAGGAGTGTA TGCCGTCCGG CAAAAAGTCA 2880 

ATGAAAACCC ACTGTACGCG ATCACATCAG AGCATGTGAA CGTGTTGCTC ACCCGCACTG 2940 

AGGACAGGCT AGTGTGGAAA ACCTTGCAGG GCGACCCATG GATTAAGCAG CCCACTAACA 3000 

TACCTAAAGG AAACTTTCAG GCTACTATAG AGGACTGGGA AGCTGAACAC AAGGGAATAA 3060 

TTGCTGCAAT AAACAGCCCC ACTCCCCGTG CCAATCCGTT CAGCTGCAAG ACCAACGTTT 3120 

GCTGGGCGAA AGCATTGGAA CCGATACTAG CCACGGCCGG TATCGTACTT ACCGGTTGCC 3180 

AGTGGAGCGA ACTGTTCCCA CAGTTTGCGG ATGACAAACC ACATTCGGCC ATTTACGCCT 3240 

TAGACGTAAT TTGCATTAAG TTTTTCGGCA TGGACTTGAC AAGCGGACTG TTTTCTAAAC 3300 

AGAGCATCCC ACTAACGTAC CATCCCGCCG ATTCAGCGAG GCCGGTAGCT CATTGGGACA 3360 

ACAGCCCAGG AACCCGCAAG TATGGGTACG ATCACGCCAT TGCCGCCGAA CTCTCCCGTA 3420 

GATTTCCGGT GTTCCAGCTA GCTGGGAAGG GCACACAACT TGATTTGCAG ACGGGGAGAA 3480 

CCAGAGTTAT CTCTGCACAG CATAACCTGG TCCCGGTGAA CCGCAATCTT CCTCACGCCT 3540 

TAGTCCCCGA GTACAAGGAG AAGCAACCCG GCCCGGTCAA AAAATTCTTG AACCAGTTCA 3600 

AACACCACTC AGTACTTGTG GTATCAGAGG AAAAAATTGA AGCTCCCCGT AAGAGAATCG 3660 

AATGGATCGC CCCGATTGGC ATAGCCGGTG CAGATAAGAA CTACAACCTG GCTTTCGGGT 3720 

TTCCGCCGCA GGCACGGTAC GACCTGGTGT TCATCAACAT TGGAACTAAA TACAGAAACC 3780 

ACCACTTTCA GCAGTGCGAA GACCATGCGG CGACCTTAAA AACCCTTTCG CGTTCGGCCC 3840 
TGAATTGCCT TAACCCAGGA GGCACCCTCG TGGTGAAGTC CTATGGCTAC GCCGACCGCA . 3900 

ACAGTGAGGA CGTAGTCACC GCTCTTGCCA GAAAGTTTGT CAGGGTGTCT GCAGCGAGAC 3960 

CAGATTGTGT CTCAAGCAAT ACAGAAATGT ACCTGATTTT CCGACAACTA GACAACAGCC 4020 

GTACACGGCA ATTCACCCCG CACCATCTGA ATTGCGTGAT TTCGTCCGTG TATGAGGGTA 4080 

CAAGAGATGG AGTTGGAGCC GCGCCGTCAT ACCGCACCAA AAGGGAGAAT ATTGCTGACT 4140 

GTCAAGAGGA AGCAGTTGTC AACGCAGCCA ATCCGCTGGG TAGACCAGGC GAAGGAGTCT 4200 

GCCGTGCCAT CTATAAACGT TGGCCGACCA GTTTTACCGA TTCAGCCACG GAGACAGGCA 4260 

CCGCAAGAAT GACTGTGTGC CTAGGAAAGA AAGTGATCCA CGCGGTCGGC CCTGATTTCC 4320 

GGAAGCACCC AGAAGCAGAA GCCTTGAAAT TGCTACAAAA CGCCTACCAT GCAGTGGCAG 4380 

ACTTAGTAAA TGAACATAAC ATCAAGTCTG TCGCCATTCC ACTGCTATCT ACAGGCATTT 4440 

ACGCAGCCGG AAAAGACCGC CTTGAAGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

GAACTGACGC GGACGTAACC ATCTATTGCC TGGATAAGAA GTGGAAGGAA AGAATCGACG 4560 

. CGGCACTCCA ACTTAAGGAG TCTGTAACAG AGCTGAAGGA TGAAGATATG GAGATCGACG 4620 

ATGAGTTAGT ATGGATCCAT CCAGACAGTT GCTTGAAGGG AAGAAAGGGA TTCAGTACTA 4680 

CAAAAGGAAA ATTGTATTCG TACTTCGAAG GCACCAAATT CCATCAAGCA GCAAAAGACA 4740 

TGGCGGAGAT AAAGGTCCTG TTCCCTAATG ACCAGGAAAG TAATGAACAA CTGTGTGCCT 4800 

ACATATTGGG TGAGACCATG GAAGCAATCC GCGAAAAGTG CCCGGTCGAC CATAACCCGT 4860 

CGTCTAGCCC GCCCAAAACG TTGCCGTGCC TTTGCATGTA TGCCATGACG CCAGAAAGGG 4920 

TCCACAGACT TAGAAGCAAT AACGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CTAAGCACAA AATTAAGAAT GTTCAGAAGG TTCAGTGCAC GAAAGTAGTC CTGTTTAATC 5040 
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ATTGACGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAATTGCACT ACCATCACAA 60 

TGGAGAAGCC AGTAGTAAAC GTAGACGTAG ACCCCCAGAG TCCGTTTGTC GTGCAACTGC 120 

AAAAAAGCTT CCCGCAATTT GAGGTAGTAG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 

ATGCCAGAGC ATTTTCGCAT CTGGCCAGTA AACTAATCGA GCTGGAGGTT CCTACCACAG 240 

CGACGATCTT GGACATAGGC AGCGCACCGG CTCGTAGAAT GTTTTCCGAG CACCAGTATC 300 

ATTGTGTCTG CCCCATGCGT AGTCCAGAAG ACCCGGACCG CATGATGAAA TACGCCAGTA 360 

AACTGGCGGA AAAAGCGTGC AAGATTACAA ACAAGAACTT GCATGAGAAG ATTAAGGATC 420 

TCCGGACCGT ACTTGATACG CCGGATGCTG AAACACCATC GCTCTGCTTT CACAACGATG 480 

TTACCTGCAA CATGCGTGCC GAATATTCCG TCATGCAGGA CGTGTATATC AACGCTCCCG 540 

GAACTATCTA TCATCAGGCT ATGAAAGGCG TGCGGACCCT GTACTGGATT GGCTTCGACA 600 

CCACCCAGTT CATGTTCTCG GCTATGGCAG GTTCGTACCC TGCGTACAAC ACCAACTGGG 660 

CCGACGAGAA AGTCCTTGAA GCGCGTAACA TCGGACTTTG CAGCACAAAG CTGAGTGAAG 720 

GTAGGACAGG AAAATTGTCG ATAATGAGGA AGAAGGAGTT GAAGCCCGGG TCGCGGGTTT 780 

ATTTCTCCGT AGGATCGACA CTTTATCCAG AACACAGAGC CAGCTTGCAG AGCTGGCATC 840 

TTCCATCGGT GTTCCACTTG AATGGAAAGC AGTCGTACAC TTGCCGCTGT GATACAGTGG 900 

TGAGTTGCGA AGGCTACGTA GTGAAGAAAA TCACCATCAG TCCCGGGATC ACGGGAGAAA 960 

CCGTGGGATA CGCGGTTACA CACAATAGCG AGGGCTTCTT GCTATGCAAA GTTACTGACA 1020 

CAGTAAAAGG AGAACGGGTA TCGTTCCCTG TGTGCACGTA CATCCCGGCC ACCATATGCG 1080 

ATCAGATGAC TGGTATAATG GCCACGGATA TATCACCTGA CGATGCACAA AAACTTCTGG 1140 

TTGGGCTCAA CCAGCGAATT GTCATTAACG GTAGGACTAA CAGGAACACC AACACCATGC 1200 

AAAATTACCT TCTGCCGATC ATAGCACAAG GGTTCAGCAA ATGGGCTAAG GAGCGCAAGG 1260 

ATGATCTTGA TAACGAGAAA ATGCTGGGTA CTAGAGAACG CAAGCTTACG TATGGCTGCT 1320 
TGTGGGCGTT TCGCACTAAG AAAGTACAIT CGTTTTATCG CCCACCTGGA ACGCAGACCT • 1380 

GCGTAAAAGT CCCAGCCTCT TTTAGCGCTT TTCCCATGTC GTCCGTATGG ACGACCTCTT 1440 

TGCCCATGTC GCTGAGGCAG AAATTGAAAC TGGCATTGCA ACCAAAGAAG GAGGAAAAAC 1500 

TGCTGCAGGT CTCGGAGGAA TTAGTCATGG AGGCCAAGGC TGCTTTTGAG GATGCTCAGG 1560 

AGGAAGCCAG AGCGGAGAAG CTCCGAGAAG CACTTCCACC ATTAGTGGCA GACAAAGGCA 1620 

TCGAGGCAGC CGCAGAAGTT GTCTGCGAAG TGGAGGGGCT CCAGGCGGAC ATCGGAGCAG 1680 

CATTAGTTGA AACCCCGCGC GGTCACGTAA GGATAATACC TCAAGCAAAT GACCGTATGA 1740 

TCGGACAGTA TATCGTTGTC TCGCCAAACT CTGTGCTGAA GAATGCCAAA CTCGCACCAG 1800 

CGCACCCGCT AGCAGATCAG GTTAAGATCA TAACACACTC CGGAAGATCA GGAAGGTACG 1860 

CGGTCGAACC ATACGACGCT AAAGTACTGA TGCCAGCAGG AGGTGCCGTA CCATGGCCAG 1920 

AATTCCTAGC ACTGAGTGAG AGCGCCACGT TAGTGTACAA CGAAAGAGAG TTTGTGAACC 1980 

GCAAACTATA CCACATTGCC ATGCATGGCC CCGCCAAGAA TACAGAAGAG GAGCAGTACA 2040 

-AGGTTACAAA GGCAGAGCTT GCAGAAACAG AGTACGTGTT TGACGTGGAC AAGAAGCGTT 2100 

GCGTTAAGAA GGAAGAAGCC TCAGGTCTGG TCCTCTCGGG AGAACTGACC AACCCTCCCT 2160 

ATCATGAGCT AGCTCTGGAG GGACTGAAGA CCCGACCTGC GGTCCCGTAC AAGGTCGAAA 2220 

CAATAGGAGT GATAGGCACA CCGGGGTCGG GCAAGTCAGC TATTATCAAG TCAACTGTCA 2280 

CGGCACGAGA TCTTGTTACC AGCGGAAAGA AAGAAAATTG TCGCGAAATT GAGGCCGACG 2340 

TGCTAAGACT GAGGGGTATG CAGATTACGT CGAAGACAGT AGATTCGGTT ATGCTCAACG 2400 

GATGCCACAA AGCCGTAGAA GTGCTGTACG TTGACGAAGC GTTCGCGTGC CACGCAGGAG 2460 

CACTACTTGC CTTGATTGCT ATCGTCAGGC CCCGCAAGAA GGTAGTACTA TGCGGAGACC 2520 
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TCAGAGGGGA AATAAAGCAT CTCTACGGTG GTCCTAAATA GTCAGCATAG TACATTTCAT 7620 

CTGACTAATA CTACAACACC ACCACCATGA ATAGAGGATT CTTTAACATG CTCGGCCGCC 76B0 

GCCCCTTCCC GGCCCCCACT GCCATGTGGA GGCCGCGGAG AAGGAGGCAG GCGGCCCCGA 7740 

TGCCTGCCCG CAACGGGCTG GCTTCTCAAA TCCAGCAACT GACCACAGCC GTCAGTGCCC 7800 

TAGTCATTGG ACAGGCAACT AGACCTCAAC CCCCACGTCC ACGCCCGCCA CCGCGCCAGA 7860 

AGAAGCAGGC GCCCAAGCAA CCACCGAAGC CGAAGAAACC AAAAACGCAG GAGAAGAAGA 7920 

AGAAGCAACC TGCAAAACCC AAACCCGGAA AGAGACAGCG CATGGCACTT AAGTTGGAGG 7980 

CCGACAGATT GTTCGACGTC 8000 
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CGCACACTCC CGCATTCGTT CCCGCCCGTA AGTACATAGA AGTGCCAGAA CAGCCTACCG 5100 

CTCCTCCTGC ACAGGCCGAG GAGGCCCCCG AAGTTGTAGC GACACCGTCA CCATCTACAG 5160 

CT GAT A AC AC CTCGCTTGAT GTCACAGACA TCTCACTGGA TATGGATGAC AGTAGCGAAG 5220 

GCTCACTTTT TTCGAGCTTT AGCGGATCGG ACAACTCTAT TACTAGTATG GACAGTTGGT 5280 

CGTCAGGACC TAGTTCACTA GAGATAGTAG ACCGAAGGCA GGTGGTGGTG GCTGACGTTC 5340 

ATGCCGTCCA AGAGCCTGCC CCTATTCCAC CGCCAAGGCT AAAGAAGATG GCCCGCCTGG 5400 

CAGCGGCAAG AAAAGAGCCC ACTCCACCGG CAAGCAATAG CTCTGAGTCC CTCCACCTCT 5460 

CTTTTGGTGG GGTATCCATG TCCCTCGGAT CAATTTTCGA CGGAGAGACG GCCCGCCAGG 5520 

CAGCGGTACA ACCCCTGGCA ACAGGCCCCA CGGATGTGCC TATGTCTTTC GGATCGTTTT 5580 

CCGACGGAGA GATTGATGAG CTGAGCCGCA GAGTAACTGA GTCCGAACCC GTCCTGTTTG 5640 

GATCATTTGA ACCGGGCGAA GTGAACTCAA TTATATCGTC CCGATCAGCC GTATCTTTTC 5700 

CACTACGCAA GCAGAGACGT AGACGCAGGA GCAGGAGGAC TGAATACTGA CTAACCGGGG 5760 

TAGGTGGGTA CATATTTTCG ACGGACACAG GCCCTGGGCA CTTGCAAAAG AAGTCCGTTC 5820 

TGCAGAACCA GCTTACAGAA CCGACCTTGG AGCGCAATGT CCTGGAAAGA ATTCATGCCC 5880 

CGGTGCTCGA CACGTCGAAA GAGGAACAAC TCAAACTCAG GTACCAGATG ATGCCCACCG 5940 

AAGCCAACAA AAGTAGGTAC CAGTCTCGTA AAGTAGAAAA TCAGAAAGCC ATAACCACTG 6000 

AGCGACTACT GTCAGGACTA CGACTGTATA ACTCTGCCAC AGATCAGCCA GAATGCTATA 6060 

AGATCACCTA TCCGAAACCA TTGTACTCCA GTAGCGTACC GGCGAACTAC TCCGATCCAC 6120 

AGTTCGCTGT AGCTGTCTGT AACAACTATC TGCATGAGAA CTATCCGACA GTAGCATCTT 6180 

ATCAGATTAC TGACGAGTAC GATGCTTACT TGGATATGGT AGACGGGACA GTCGCCTGCC 6240 

TGGATACTGC AACCTTCTGC CCCGCTAAGC TTAGAAGTTA CCCGAAAAAA CATGAGTATA 6300 

GAGCCCCGAA TATCCGCAGT GCGGTTCCAT CAGCGATGCA GAACACGCTA CAAAATGTGC 6360 

TCATTGCCGC AACTAAAAGA AATTGCAACG TCACGCAGAT GCGTGAACTG CCAACACTGG *6420 

ACTCAGCGAC ATTCAATGTC GAATGCTTTC GAAAATATGC ATGTAATGAC GAGTATTGGG 6480 

AGGAGTTCGC TCGGAAGCCA ATTAGGATTA CCACTGAGTT TGTCACCGCA TATGTAGCTA 6540 

GACTGAAAGG CCCTAAGGCC GCCGCACTAT TTGCAAAGAC GTATAATTTG GTCCCATTGC 6600 

AAGAAGTGCC TATGGATAGA TTCGTCATGG ACATGAAAAG AGACGTGAAA GTTACACCAG 6660 

GCACGAAACA CACAGAAGAA AGACCGAAAG TACAAGTGAT ACAAGCCGCA GAACCCCTGG 6720 

CGACTGCTTA CTTATGCGGG ATTCACCGGG AATTAGTGCG TAGGCTTACG GCCGTCTTGC 6780 

TTCCAAACAT TCACACGCTT TTTGACATGT CGGCGGAGGA TTTTGATGCA ATCATAGCAG 6840 

AACACTTCAA GCAAGGCGAC CCGGTACTGG AGACGGATAT CGCATCATTC GACAAAAGCC 6900 

AAGACGACGC TATGGCGTTA ACCGGTCTGA TGATCTTGGA GGACCTGGGT GTGGATCAAC 6960 

CACTACTCGA CTTGATCGAG TGCGCCTTTG GAGAAATATC ATCCACCCAT CTACCTACGG 7020 

GTACTCGTTT TAAATTCGGG GCGATGATGA AATCCGGAAT GTTCCTCACA CTTTTTGTCA 7080 

ACACAGTTTT GAATGTCGTT ATCGCCAGCA GAGTACTAGA AGAGCGGCTT AAAACGTCCA 7140 

GATGTGCAGC GTTCATTGGC GACGACAACA TCATACATGG AGTAGTATCT GACAAAGAAA 7200 

TGGCTGAGAG GTGCGCCACC TGGCTCAACA TGGAGGTTAA GATCATCGAC GCAGTCATCG 7260 

GTGAGAGACC ACCTTACTTC TGCGGCGGAT TTATCTTGCA AGATTCGGTT ACTTCCACAG 7320 

CGTGCCGCGT GGCGGATCCC CTGAAAAGGC TGTTTAAGTT GGGTAAACCG CTCCCAGCCG 7380 

ACGACGAGCA AGACGAAGAC AGAAGACGCG CTCTGCTAGA TGAAACAAAG GCGTGGTTTA 7440 

GAGTAGGTAT AACAGGCACT TTAGCAGTGG CCGTGACGAC CCGGTATGAG GTAGACAATA 7500 

TTACACCTGT CCTACTGGCA TTGAGAACTT TTGCCCAGAG CAAAAGAGCA TTCCAAGCCA 7560 
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CCATGCAATG CGGATTCTTC AACATGATGC AACTAAAGGT ACATTTCAAT CACCCTGAAA 2580 

AAGACATATG CACCAAGACA TTCTACAAGT ATATCTCCCG GCGTTGCACA CAGCCAGTTA 2640 

CAGCTATTGT ATCGACACTG CATTACGATG GAAAGATGAA AACCACGAAC CCGTGCAAGA 2700 

AGAACATTGA AATCGATATT ACAGGGGCCA CAAAGCCGAA GCCAGGGGAT ATCATCCTGA 2760 

CATGTTTCCG CGGGTGGGTT AAGCAATTGC AAATCGACTA TCCCGGACAT GAAGTAATGA 2820 

CAGCCGCGGC CTCACAAGGG CTAACCAGAA AAGGAGTGTA TGCCGTCCGG CAAAAAGTCA 2880 

ATGAAAACCC ACTGTACGCG ATCACATCAG AGCATGTGAA CGTGTTGCTC ACCCGCACTG 2940 

AGGACAGGCT AGTGTGGAAA ACCTTGCAGG GCGACCCATG GATTAAGCAG CCCACTAACA 3000 

TACCTAAAGG AAACTTTCAG GCTACTATAG AGGACTGGGA AGCTGAACAC AAGGGAATAA 3060 

TTGCTGCAAT AAACAGCCCC ACTCCCCGTG CCAATCCGTT CAGCTGCAAG ACCAACGTTT 3120 

GCTGGGCGAA AGCATTGGAA CCGATACTAG CCACGGCCGG TATCGTACTT ACCGGTTGCC 3180 

AGTGGAGCGA ACTGTTCCCA CAGTTTGCGG ATGACAAACC ACATTCGGCC ATTTACGCCT 3240 

TAGACGTAAT TTGCATTAAG TTTTTCGGCA TGGACTTGAC AAGCGGACTG TTTTCTAAAC 3300 

AGAGCATCCC ACTAACGTAC CATCCCGCCG ATTCAGCGAG GCCGGTAGCT CATTGGGACA 3360 

ACAGCCCAGG AACCCGCAAG TATGGGTACG ATCACGCCAT TGCCGCCGAA CTCTCCCGTA 3420 

GATTTCCGGT GTTCCAGCTA GCTGGGAAGG GCACACAACT TGATTTGCAG ACGGGGAGAA 3480 

CCAGAGTTAT CTCTGCACAG CATAACCTGG TCCCGGTGAA CCGCAATCTT CCTCACGCCT 3540 

TAGTCCCCGA GTACAAGGAG AAGCAACCCG GCCCGGTCAA AAAATTCTTG AACCAGTTCA 3600 

AACACCACTC AGTACTTGTG GTATCAGAGG AAAAAATTGA AGCTCCCCGT AAGAGAATCG 3660 

AATGGATCGC CCCGATTGGC ATAGCCGGTG CAGATAAGAA CTACAACCTG GCTTTCGGGT 3720 

TTCCGCCGCA GGCACGGTAC GACCTGGTGT TCATCAACAT TGGAACTAAA TACAGAAACC 3780 

ACCACTTTCA GCAGTGCGAA GACCATGCGG CGACCTTAAA AACCCTTTCG CGTTCGGCCC 3840 
TGAATTGCCT TAACCCAGGA GGCACCCTGG TGGTGAAGTC CTATGGCTAC GCCGACCGCA • 3900 

ACAGTGAGGA CGTAGTCACC GCTCTTGCCA GAAAGTTTGT CAGGGTGTCT GCAGCGAGAC 3960 

CAGATTGTGT CTCAAGCAAT ACAGAAATGT ACCTGATTTT CCGACAACTA GACAACAGCC 4020 

GTACACGGCA ATTCACCCCG CACCATCTGA ATTGCGTGAT TTCGTCCGTG TATGAGGGTA 4080 

CAAGAGATGG AGTTGGAGCC GCGCCGTCAT ACCGCACCAA AAGGGAGAAT ATTGCTGACT 4140 

GTCAAGAGGA AGCAGTTGTC AACGCAGCCA ATCCGCTGGG TAGACCAGGC GAAGGAGTCT 4200 

GCCGTGCCAT CTATAAACGT TGGCCGACCA GTTTTACCGA TTCAGCCACG GAGACAGGCA 4260 

CCGCAAGAAT GACTGTGTGC CTAGGAAAGA AAGTGATCCA CGCGGTCGGC CCTGATTTCC 4320 

GGAAGCACCC AGAAGCAGAA GCCTTGAAAT TGCTACAAAA CGCCTACCAT GCAGTGGCAG 4380 

ACTTAGTAAA TGAACATAAC ATCAAGTCTG TCGCCATTCC ACTGCTATCT ACAGGCATTT 4440 

ACGCAGCCGG AAAAGACCGC CTTGAAGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

GAACTGACGC GGACGTAACC ATCTATTGCC TGGATAAGAA GTGGAAGGAA AGAATCGACG 4560 

* CGGCACTCCA ACTTAAGGAG TCTGTAACAG AGCTGAAGGA TGAAGATATG GAGATCGACG 4620 

ATGAGTTAGT ATGGATCCAT CCAGACAGTT GCTTGAAGGG AAGAAAGGGA TTCAGTACTA 4680 

CAAAAGGAAA ATTGTATTCG TACTTCGAAG GCACCAAATT CCATCAAGCA GCAAAAGACA 4740 

TGGCGGAGAT AAAGGTCCTG TTCCCTAATG ACCAGGAAAG TAATGAACAA CTGTGTGCCT 4800 

ACATATTGGG TGAGACCATG GAAGCAATCC GCGAAAAGTG CCCGGTCGAC CATAACCCGT 4860 

CGTCTAGCCC GCCCAAAACG TTGCCGTGCC TTTGCATGTA TGCCATGACG CCAGAAAGGG 4920 

TCCACAGACT TAGAAGCAAT AACGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CTAAGCACAA AATTAAGAAT GTTCAGAAGG TTCAGTGCAC GAAAGTAGTC CTGTTTAATC 5040 
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ATTGACGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAATTGCACT ACCATCACAA 60 

TGGAGAAGCC AGTAGTAAAC GTAGACGTAG ACCCCCAGAG TCCGTTTGTC GTGCAACTGC 120 

AAAAAAGCTT CCCGCAATTT GAGGTAGTAG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 

ATGCCAGAGC ATTTTCGCAT CTGGCCAGTA AACTAATCGA GCTGGAGGTT CCTACCACAG 240 

CGACGATCTT GGACATAGGC AGCGCACCGG CTCGTAGAAT GTTTTCCGAG CACCAGTATC 300 

ATTGTGTCTG CCCCATGCGT AGTCCAGAAG ACCCGGACCG CATGATGAAA TACGCCAGTA 360 

AACTGGCGGA AAAAGCGTGC AAGATTACAA ACAAGAACTT GCATGAGAAG ATTAAGGATC 420 

TCCGGACCGT ACTTGATACG CCGGATGCTG AAACACCATC GCTCTGCTTT CACAACGATG 480 

TTACCTGCAA CATGCGTGCC GAATATTCCG TCATGCAGGA CGTGTATATC AACGCTCCCG 540 

GAACTATCTA TCATCAGGCT ATGAAAGGCG TGCGGACCCT GTACTGGATT GGCTTCGACA 600 

CCACCCAGTT CATGTTCTCG GCTATGGCAG GTTCGTACCC TGCGTACAAC ACCAACTGGG 660 

CCGACGAGAA AGTCCTTGAA GCGCGTAACA TCGGACTTTG CAGCACAAAG CTGAGTGAAG 720 

GTAGGACAGG AAAATTGTCG ATAATGAGGA AGAAGGAGTT GAAGCCCGGG TCGCGGGTTT 780 

ATTTCTCCGT AGGATCGACA CTTTATCCAG AACACAGAGC CAGCTTGCAG AGCTGGCATC 840 

TTCCATCGGT GTTCCACTTG AATGGAAAGC AGTCGTACAC TTGCCGCTGT GATACAGTGG 900 

TGAGTTGCGA AGGCTACGTA GTGAAGAAAA TCACCATCAG TCCCGGGATC ACGGGAGAAA 960 

CCGTGGGATA CGCGGTTACA CACAATAGCG AGGGCTTCTT GCTATGCAAA GTTACTGACA 1020 

CAGTAAAAGG AGAACGGGTA TCGTTCCCTG TGTGCACGTA CATCCCGGCC ACCATATGCG 1080 

ATCAGATGAC TGGTATAATG GCCACGGATA TATCACCTGA CGATGCACAA AAACTTCTGG 1140 

TTGGGCTCAA CCAGCGAATT GTCATTAACG GTAGGACTAA CAGGAACACC AACACCATGC 1200 

AAAATTACCT TCTGCCGATC ATAGCACAAG GGTTCAGCAA ATGGGCTAAG GAGCGCAAGG 1260 

ATGATCTTGA TAACGAGAAA ATGCTGGGTA CTAGAGAACG CAAGCTTACG TATGGCTGCT 1320 
TGTGGGCGTT TCGCACTAAG AAAGTACATT CGTTTTATCG CCCACCTGGA ACGCAGACCT * 1380 

GCGTAAAAGT CCCAGCCTCT TTTAGCGCTT TTCCCATGTC GTCCGTATGG ACGACCTCTT 1440 

TGCCCATGTC GCTGAGGCAG AAATTGAAAC TGGCATTGCA ACCAAAGAAG GAGGAAAAAC 1500 

TGCTGCAGGT CTCGGAGGAA TTAGTCATGG AGGCCAAGGC TGCTTTTGAG GATGCTCAGG 1560 

AGGAAGCCAG AGCGGAGAAG CTCCGAGAAG CACTTCCACC ATTAGTGGCA GACAAAGGCA 1620 

TCGAGGCAGC CGCAGAAGTT GTCTGCGAAG TGGAGGGGCT CCAGGCGGAC ATCGGAGCAG 1680 

CATTAGTTGA AACCCCGCGC GGTCACGTAA GGATAATACC TCAAGCAAAT GACCGTATGA 1740 

TCGGACAGTA TATCGTTGTC TCGCCAAACT CTGTGCTGAA GAATGCCAAA CTCGCACCAG 1800 

CGCACCCGCT AGCAGATCAG GT7AAGATCA TAACACACTC CGGAAGATCA GGAAGGTACG 1860 

CGGTCGAACC ATACGACGCT AAAGTACTGA TGCCAGCAGG AGGTGCCGTA CCATGGCCAG 1920 

AATTCCTAGC ACTGAGTGAG AGCGCCACGT TAGTGTACAA CGAAAGAGAG TTTGTGAACC 1980 

GCAAACTATA CCACATTGCC ATGCATGGCC CCGCCAAGAA TACAGAAGAG GAGCAGTACA 2040 

AGGTTACAAA GGCAGAGCTT GCAGAAACAG AGTACGTGTT TGACGTGGAC AAGAAGCGTT 2100 

GCGTTAAGAA GGAAGAAGCC TCAGGTCTGG TCCTCTCGGG AGAACTGACC AACCCTCCCT 2160 

ATCATGAGCT AGCTCTGGAG GGACTGAAGA CCCGACCTGC GGTCCCGTAC AAGGTCGAAA 2220 

CAATAGGAGT GATAGGCACA CCGGGGTCGG GCAAGTCAGC TATTATCAAG TCAACTGTCA 2280 

CGGCACGAGA TCTTGTTACC AGCGGAAAGA AAGAAAATTG TCGCGAAATT GAGGCCGACG 2340 

TGCTAAGACT GAGGGGTATG CAGATTACGT CGAAGACAGT AGATTCGGTT ATGCTCAACG 2400 

GATGCCACAA AGCCGTAGAA GTGCTGTACG TTGACGAAGC GTTCGCGTGC CACGCAGGAG 2460 

CACTACTTGC CTTGATTGCT ATCGTCAGGC CCCGCAAGAA GGTAGTACTA TGCGGAGACC 2520 
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TCAGAGGGGA AATAAAGCAT CTCTACGGTG GTCCTAAATA GTCAGCATAG TTCATTTCAT 7620 

CTGACTAATA CTACAACACC ACCACCATGA ATAGAGGATT CTTTAACATG CTCGGCCGCC 7680 

GCCCCTTCCC GGCCCCCACT GCCATGTGGA GGCCGCGGAG AAGGAGGCAG GCGGCCCCGA 7740 

TGCCTGCCCG CAACGGGCTG GCTTCTCAAA TCCAGCAACT GACCACAGCC GTCAGTGCCC 7800 

TAGTCATTGG ACAGGCAACT AGACCTCAAC CCCCATGTCC ACGCCCGCCA CCGCGCCAGA 7860 

AGAAGCAGGC GCCCAAGCAA CCACCGAAGC CGAAGAAACC AAAAACGCAG GAGAAGAAGA 7920 

AGAAGCAACC TGCAAAACCC AAACCCGGAA AGAGACAGCG CATGGCACTT AAGTTGGAGG 7980 

CCGACAGATT GTTCGACGTC 8000 
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CGCACACTCC CGCATTCGTT CCCGCCCGTA AGTACATAGA AGTGCCAGAA CAGCCTACCG 5100 

CTCCTCCTGC ACAGGCCGAG GAGGCCCCCG AAGTTGTAGC GACACCGTCA CCATCTACAG 5160 

CTGATAACAC CTCGCTTGAT GTCACAGACA TCTCACTGGA TATGGATGAC AGTAGCGAAG 5320 

GCTCACTTTT TTCGAGCTTT AGCGGATCGG ACAACTCTAT TACTAGTATG GACAGTTGGT 5280 

CGTCAGGACC TAGTTCACTA GAGATAGTAG ACCGAAGGCA GGTGGTGGTG GCTGACGTTC 5340 

ATGCCGTCCA AGAGCCTGCC CCTATTCCAC CGCCAAGGCT AAAGAAGATG GCCCGCCTGG 5400 

CAGCGGCAAG AAAAGAGCCC ACTCCACCGG CAAGCAATAG CTCTGAGTCC CTCCACCTCT 5460 

CTTTTGGTGG GGTATCCATG TCCCTCGGAT CAATTTTCGA CGGAGAGACG GCCCGCCAGG 5520 

CAGCGGTACA ACCCCTGGCA ACAGGCCCCA CGGATGTGCC TATGTCTTTC GGATCGTTTT 5580 

CCGACGGAGA GATTGATGAG CTGAGCCGCA GAGTAACTGA GTCCGAACCC GTCCTGTHG 5640 

GATCATTTGA ACCGGGCGAA GTGAACTCAA TTATATCGTC CCGATCAGCC GTATCTT . i'C 5700 

CTCTACGCAA GCAGAGACGT AGACGCAGGA GCAGGAGGAC TGAATACTGA CTAACCGGGG 5760 

TAGGTGGGTA CATATTTTCG ACGGACACAG GCCCTGGGCA CTTGCAAAAG AAGTCCGTTC 5820 

TGCAGAACCA GCTTACAGAA CCGACCTTGG AGCACAATGT CCTGGAAAGA ATTCATGCCC 5880 

CGGTGCTCGA CACGTCGAAA GAGGAACAAC TCAAACTCAG GTACCAGATG ATGCCCACCG 5940 

AAGCCAACAA AAGTAGGTAC CAGTCTCGTA AAGTAGAAAA TCAGAAAGCC ATAACCACTG 6000 

AGCGACTACT GTCAGGACTA CGACTGTATA ACTCTGCCAC AGATCAGCCA GAATGCTATA 6060 

AGATCACCTA TCCGAAACCA TTGTACTCCA GTAGCGTACC GGCGAACTAC TCCGATCCAC 6120 

AGTTCGCTGT AGCTGTCTGT AACAACTATC TGCATGAGAA CTATCCGACA GTAGCATCTT 6180 

ATCAGATTAC TGACGAGTAC GATGCTTACT TGGATATGGT AGACGGGACA GTCGCCTGCC 6240 

TGGATACTGC AACCTTCTGC CCCGCTAAGC TTAGAAGTTA CCCGAAAAAA CATGAGTATA 6300 

GAGCCCCGAA TATCCGCAGT GCGGTTCCAT CAGCGATGCA GAACACGCTA CAAAATGTGC 6360 
TCATTGCCGC AACTAAAAGA AATTGCAACG TCACGCAGAT GCGTGAACTG CCAACACTGG . 6420 

ACTCAGCGAC ATTCAATGTC GAATGCTTTC GAAAATATGC ATGTAATGAC GAGTATTGGG 6480 

AGGAGTTCGC TCGGAAGCCA ATTAGGATTA CCACTGAGTT TGTCACCGCA TATGTAGCTA 6540 

GACTGAAAGG CCCTAAGGCC GCCGCACTAT TTGCAAAGAC GTATAATTTG GTCCCATTGC 6600 

AAGAAGTGCC TATGGATAGA TTCGTCATGG ACATGAAAAG AGACGTGAAA GTTACACCAG 6660 

GCACGAAACA CACAGAAGAA AGACCGAAAG TACAAGTGAT ACAAGCCGCA GAACCCCTGG 6720 

CGACTGCTTA CTTATGCGGG ATTCACCGGG AATTAGTGCG TAGGCTTACG GCCGTCTTGC 6780 

TTCCAAACAT TCACACGCTT TTTGACATGT CGGCGGAGGA TTTTGATGCA ATCATAGCAG 6840 

AACACTTCAA GCAAGGCGAC CCGGTACTGG AGACGGATAT CGCATCATTC GACAAAAGCC 6900 

AAGACGACGC TATGGCGTTA ACCGGTCTGA TGATCTTGGA GGACCTGGGT GTGGATCAAC 6960 

CACTACTCGA CTTGATCGAG TGCGCCTTTG GAGAAATATC ATCCACCCAT CTACCTACGG 7020 

GTACTCGTTT TAAATTCGGG GCGATGATGA AATCCGGAAT GTTCCTCACA CTTTTTGTCA 7080 

ACACAGTTTT GAATGTCGTT ATCGCCAGCA GAGTACTAGA AGAGCGGCTT AAAACGTCCA 7140 

GATGTGCAGC GTTCATTGGC GACGACAACA TCATACATGG AGTAGTATCT GACAAAGAAA 7200 

TGGCTGAGAG GTGCGCCACC TGGCTCAACA TGGAGGTTAA GATCATCGAC GCAGTCATCG 7260 

GTGAGAGACC ACCTTACTTC TGCGGCGGAT TTATCTTGCA AGATTCGGTT ACTTCCACAG 7320 

CGTGCCGCGT GGCGGATCCC CTGAAAAGGC TGTTTAAGTT GGGTAAACCG CTCCCAGCCG 7380 

ACGACGAGCA AGACGAAGAC AGAAGACGCG CTCTGCTAGA TGAAACAAAG GCGTGGTTTA 7440 

GAGTAGGTAT AACAGGCACT TTAGCAGTGG CCGTGACGAC CCGGTATGAG GTAGACAATA 7500 

TTACACCTGT CCTACTGGCA TTGAGAACTT TTGCCCAGAG CAAAAGAGCA TTCCAAGCCA 7560 
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CCATGCAATG CGGATTCTTC AACATGATGC AACTAAAGGT ACATTTCAAT CACCCTGAAA 2580 

AAGACATATG CACCAAGACA TTCTACAAGT ATATCTCCCG GCGTTGCACA CAGCCAGTTA 2640 

CAGCTATTGT ATCGACACTG CATTACGATG GAAAGATGAA AACCACGAAC CCGTGCAAGA 2700 

AGAACATTGA AATCGATATT ACAGGGGCCA CAAAGCCGAA GCCAGGGGAT ATCATCCTGA 2760 

CATGTTTCCG CGGGTGGGTT AAGCAATTGC AAATCGACTA TCCCGGACAT GAAGTAATGA 2820 

CAGCCGCGGC CTCACAAGGG CTAACCAGAA AAGGAGTGTA TGCCGTCCGG CAAAAAGTCA 2880 

ATGAAAACCC ACTGTACGCG ATCACATCAG AGCATGTGAA CGTGTTGCTC ACCCGCACTG 2940 

AGGACAGGCT AGTGTGGAAA ACCTTGCAGG GCGACCCATG GATTAAGCAG CTCACTAACA 3000 

TACCTAAAGG AAACTTTCAG GCTACTATAG AGGACTGGGA AGCTGAACAC AAGGGAATAA 3060 

TTGCTGCAAT AAACAGCCCC ACTCCCCGTG CCAATCCGTT CAGCTGCAAG ACCAACGTTT 3120 

GCTGGGCGAA AGCATTGGAA CCGATACTAG CCACGGCCGG TATCGTACTT ACCGGTTGCC 3180 

AGTGGAGCGA ACTGTTCCCA CAGTTTGCGG ATGACAAACC ACATTCGGCC ATTTACGCCT 3240 

TAGACGTAAT TTGCATTAAG TTTTTCGGCA TGGACTTGAC AAGCGGACTG TTTTCTAAAC 3300 

AGAGCATCCC ACTAACGTAC CATCCCGCCG ATTCAGCGAG GCCGGTAGCT CATTGGGACA 3360 

ACAGCCCAGG AACCCGCAAG TATGGGTACG ATCACGCCAT TGCCGCCGAA CTCTCCCGTA 3420 

GATTTCCGGT GTTCCAGCTA GCTGGGAAGG GCACACAACT TGATTTGCAG ACGGGGAGAA 3480 

CCAGAGTTAT CTCTGCACAG CATAACCTGG TCCCGGTGAA CCGCAATCTT CCTCACGCCT 3540 

TAGTCCCCGA GTACAAGGAG AAGCAACCCG GCCCGGTCGA AAAATTCTTG AACCAGTTCA 3600 

AACACCACTC AGTACTTGTG GTATCAGAGG AAAAAATTGA AGCTCCCCGT AAGAGAATCG 3660 

AATGGATCGC CCCGATTGGC ATAGCCGGTG CAGATAAGAA CTACAACCTG GCTTTCGGGT 3720 

TTCCGCCGCA GGCACGGTAC GACCTGGTGT TCATCAACAT TGGAACTAAA TACAGAAACC 3780 

ACCACTTTCA GCAGTGCGAA GACCATGCGG CGACCTTAAA AACCCTTTCG CGTTCGGCCC 3840 

TGAATTGCCT TAACTCAGGA GGCACTCTCG TGGTGAAGTC CTATGGCTAC GCCGACCGCA * 3900 

ACAGTGAGGA CGTAGTCACC GCTCTTGCCA GAAAGTTTGT CAGGGTGTCT GCAGCGAGAC 3960 

CAGATTGTGT CTCAAGCAAT ACAGAAATGT ACCTGATTTT CCGACAACTA GACAACAGCC 4020 

GTACACGGCA ATTCACCCCG CACCATCTGA ATTGCGTGAT TTCGTCCGTG TATGAGGGTA 4080 

CAAGAGATGG AGTTGGAGCC GCGCCGTCAT ACCGCACCAA AAGGGAGAAT ATTGCTGACT 4140 

GTCAAGAGGA AGCAGTTGTC AACGCAGCCA ATCCGCTGGG TAGACCAGGC GAAGGAGTCT 4200 

GCCGTGCCAT CTATAAACGT TGGCCGACCA GTTTTACCGA TTCAGCCACG GAGACAGGCA 4260 

CCGCAAGAAT GACTGTGTGC CTAGGAAAGA AAGTGATCCA CGCGGTCGGC CCTGATTTCC 4320 

GGAAGCACCC AGAAGCAGTA GCCTTGAAAT TGCTACAAAA CGCCTACCAT GCAGTGGCAG 4380 

ACTTAGTAAA TGAACATAAC ATCAAGTCTG TCGCCATTCC ACTGCTATCT ACAGGCATTT 4440 

ACGCAGCCGG AAAAGACCGC CTTGAAGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

GAACTGACGC GGACGTAACC ATCTATTGCC TGGATAAGAA GTGGAAGGAA AGAATCGACG 4560 

CGGCACTCCA ACTTAAGGAG TCTGTAACAG AGCTGAAGGA TGAAGATATG GAGATCGACG 4620 

ATGAGTTAGT ATGGATCCAT CCAGACAGTT GCTTGAAGGG AAGAAAGGGA TTCAGTACTA 4680 

CAAAAGGAAA ATTGTATTCG TACTTCGAAG GCACCAAATT CCATCAAGCA GCAAAAGACA 4740 

TGGCGGAGAT AAAGGTCCTG TTCCCTAATG ACCAGGAAAG TAATGAACAA CTGTGTGCCT 4800 

ACATATTGGG TGAGACCATG GAAGCAATCC GCGAAAAGTG CCCGGTCGAC CATAACCCGT 4860 

CGTTTAGCCC GCCCAAAACG TTGCCGTGCC TTTGCATGTA TGCCATGACG CCAGAAAGGG 4920 

TCCACAGACT TAGAAGCAAT AACGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CTAAGCACAA AATTAAGAAT GTTCAGAAGG TTCAGTGCAC GAAAGTAGTC CTGTTTAATC 5040 
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ATTGACGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAATCGCACT ACCATCACAA 60 

TGGAGAAGCC AGTAGTAAAC GTAGACGTAG ACCCCCAGAG TCCGTTTGTC GTGCAACTGA 120 

AAAAAAGCTT CCCGCAATTT GAGGTAGTAG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 

ATGCCAGAGC ATTTTCGCAT CTGGCCAGTA AACTAATCGA GCTGGAGGTT CCTACCACAG 240 

CGACGATCTT GGACATAGGC AGCGCACCGG CTCGTAGAAT GTTTTCCGAG CACCAGTATC 300 

ATTGTGTCTG CCCCATGCGT AGTCCAGAAG ACCCGGACCG CATGATGAAA TACGCCAGTA 360 

AACTGGCGGA AAAAGCGTGC AAGATTACAA ACAAGAACTT GCATGAGAAG ATTAAGGATC 420 

TCCGGACCGT ACTTGATACG CCGGATGCTG AAACACCATC GCTCTGCTTT CACAACGATG 480 

TTACCTGCAA CATGCGTGCC GAATATTCCG TCATGCAGGA CGTGTATATC AACGCTCCCG 540 

GAACTATCTA TCATCAGGCT ATGAAAGGCG TGCGGACCCT GTACTGGATT GGCTTCGACA 600 

CCACCCAGTT CATGTTCTCG GCTATGGCAG GTTCGTACCC TGCGTACAAC ACCAACTGGG 660 

CCGACGAGAA AGTCCTTGAA GCGCGTAACA TCGGACTTTG CAGCACAAAG CTGAGTGAAG 720 

GTAGGACAGG AAAATTGTCG ATAATGAGGA AGAAGGAGTT GAAGCCCGGG TCGCGGGTTT 780 

ATTTCTCCGT AGGATCGACA CTTTATCCAG AACACAGAGC CAGCTTGCAG AGCTGGCATC 840 

TTCCATCGGT GTTCCACTTG AATGGAAAGC AGTCGTACAC TTGCCGCTGT GATACAGTGG 900 

TGAGTTGCGA AGGCTACGTA GTGAAGAAAA TCACCATCAG TCCCGGGATC ACGGGAGAAA 960 

CCGTGGGATA CGCGGTTACA CACAATAGCG AGGGCTTCTT GCTATGCAAA GTTACTGACA 1020 

CAGTAAAAGG AGAACGGGTA TCGTTCCCTG TGTGCACGTA CATCCCGGCC ACCATATGCG 1080 

ATCAGATGAC TGGTATAATG GCCACGGATA TATCACCTGA CGATGCACAA AAACTTCTGG 1140 

TTGGGCTCAA CCAGCGAATT GTCATTAACG GTAGGACTAA CAGGAACACC AACACCATGC 1200 

AAAATTACCT TCTGCCGATC ATAGCACAAG GGTTCAGCAA ATGGGCTAAG GAGCGCAAGG 1260 

ATGATCTTGA TAACGAGAAA ATGCTGGGTA CTAGAGAACG CAAGCTTACG TATGGCTGCT 1320 
TGTGGGCGTT TCGCACTAAG AAAGTACAIT CGTTTTATCG CCCACCTGGA ACGCAGACC7 • 1380 

GCGTAAAAGT CCCAGCCTCT TTTAGCGCTT TTCCCATGTC GTCCGTATGG ACGACCTCTT 1440 

TGCCCATGTC GCTGAGGCAG AAATTGAAAC TGGCATTGCA ACCAAAGAAG GAGGAAAAAC 1500 

TGCTGCAGGT CTCGGAGGAA TTAGTCATGG AGGCCAAGGC TGCTTTTGAG GATGCTCAGG 1560 

AGGAAGCCAG AGCGGAGAAG CTCCGAGAAG CACTTCCACC ATTAGTGGCA GACAAAGGCA 1620 

TCGAGGCAGC CGCAGAAGTT GTCTGCGAAG TGGAGGGGCT CCAGGCGGAC ATCGGAGCAG 1680 

CATTAGTTGA AACCCCGCGC GGTCACGTAA GGATAATACC TCAAGCAAAT GACCGTATGA 1740 

TCGGACAGTA TATCGTTGTC TCGCCAAACT CTGTACTGAA GAATGCCAAA CTCGCACCAG 1800 

CGCACCCGCT AGCAGATCAG GTTAAGATCA TAACACACTC CGGAAGATCA GGAAGGTACG 1860 

CGGTCGAACC ATACGACGCT AAAGTACTGA TGCCAGCAGG AGGTGCCGTA CCATGGCCAG 1920 

AATTCCTAGC ACTGAGTGAG AGCGCCACGT TAGTGTACAA CGAAAGAGAG CTTGTGAACC 1980 

GCAAACTATA CCACATTGCC ATGCATGGCC CCGCCAAGAA TACAGAAGAG GAGCAGTACA 2040 

AGGTTACAAA GGCAGAGCTT GCAGAAACAG AGTACGTGTT TGACGTGGAC AAGAAGCGTT 2100 

GCGTTAAGAA GGAAGAAGCC TCAGGTCTGG TCCTCTCGGG AGAACTGACC AACCCTCCCT 2160 

ATCATGAGCT AGCTCTGGAG GGACTGAAGA CCCGACCTGC GGTCCCGTAC AAGGTCGAAA 2220 

CAATAGGAGT GATAGGCACA CCGGGGTCGG GCAAGTCAGC TATTATCAAG TCAACTGTCA 2280 

CGGCACGAGA TCTTGTTACC AGCGGAAAGA AAGAAAATTG TCGCGAAATT GAGGCCGACG 2340 

TGCTAAGACT GAGGGGTATG CAGATTACGT CGAAGACAGT AGATTCGGTT ATGCTCAACG 2400 

GATGCCACAA AGCCGTAGAA GTGCTGTACG TTGACGAAGC GTTCGCGTGC CACGCAGGAG 2460 

CACTACTTGC CTTGATTGCT ATCGTCAGGC CCCGCAAGAA GGTAGTACTA TGCGGAGACC 2520 
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recombinant alphavirus particle according to claim 6, or a eukaryotic layered vector initiation 
system according to claim 18. 

22. A method of making recombinant alphavirus particles, comprising: 

(a) introducing a vector selected from the group consisting of a eukaryotic 
layered vector initiation system according to claim 18, an RNA vector replicon according to 
claim 4, and a recombinant alphavirus vector particle according to claim 6, into a population 
of packaging cells according to claim 16, under conditions and for a time sufficient to permit 
production of recombinant alphavirus particles; and 

(b) harvesting recombinant alphavirus particles. 

23. A method of making a selected protein, comprising: 

(a) introducing a vector which encodes a selected heterologous protein, 
and which is selected from the group consisting of a eukaryotic layered vector initiation 
system according to claim 18, an alphavirus RNA vector replicon according to claim 4, and a 
recombinant alphavirus vector particle according to claim 6, into a population of packaging 
cells according to claim 16, under conditions and for a time sufficient to permit production or 
said selected protein; and 

(b) harvesting protein produced by the packaging cells. 

24. A method of making a selected protein, comprising introducing a 
eukaryotic layered vector initiation system according to claim 18 into a host cell, under 
conditions and for sufficient to permit expression of said selected protein. 

25. A host cell line which contains an alphavirus RNA vector replicon 
according to claim 4. 
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encodes one or more functional alphavirus structural proteins, a selectable marker operably 
linked to transcription of the expression cassette, and a 3' sequence which controls 
transcription termination. 



16. An alphavirus packaging cell line, comprising a cell containing an 
alphavirus structural protein expression cassette according to claim 15. 

17. An alphavirus producer cell linfe, comprising a cell containing a stably 
transformed alphavirus structural protein expression cassette, and a vector selected from the 
group consisting of an RNA vector replicon according to claim 4, an alphavirus vector 
construct according to claim 3, and a eukaryotic layered vector initiation system according to 
claim 1 8. 



18. A eukaryotic layered vector initiation system, comprising a 5' promoter 
capable of initiating in vivo the 5' synthesis of alphavirus RNA from cDNA, a sequence 
which initiates transcription of alphavirus RNA following the 5* promoter, a nucleic acid 
molecule which operably encodes all four alphaviral nonstructural proteins, including a 
nucleic acid molecule according to claims 1 or 2, an alphavirus RNA polymerase recognition 
sequence, and a 3* polyadenylate tract. 

19. A host cell containing a eukaryotic layered vector initiation system 
according to claim 1 8. 



20 - A pharmaceutical composition, comprising a eukaryotic layered vector 
initiation system according to claim 18 and a pharmaceutically acceptable carrier or diluent. 

21. A method for delivering a selected heterologous sequence to a 
vertebrate or insect, comprising administering to a vertebrate or insect an alphavirus vector 
construct according to claim 3, an alphavirus RNA vector replicon according to claim 4, a 
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6. A recombinant alphavirus particle, comprising one or more alphavirus 
structural proteins, a lipid envelope, and an RNA vector replicon according to claim 4. 

7. The recombinant alphavirus particle according to claim 6 wherein said 
alphavirus structural protein and lipid envelope are derived from different alphavirus species. 

8. A pharmaceutical composition, comprising a recombinant alphavirus 
particle according to claim 6 or 7 and a pharmaceutically acceptable carrier or diluent. 

9. A host cell infected with a recombinant alphavirus particle according to 

claim 6 or 7. 

10. A togavirus capsid particle which contains substantially no genomic or 
RNA Vector Replicon nucleic acids. 

11. The capsid particle, according to claim 10, further comprising a lipid 
envelope containing one or more alphavirus glycoproteins. 

12. The capsid particle according to claim 10, further comprising an 
alphavirus envelope. 

13. The capsid particle according to claim 10 wherein said capsid is 
derived from a togavirus selected from the group consisting of alphaviruses, rotaviruses, 
flaviviruses and pestiviruses. 

14. A pharmaceutical composition, comprising a capsid particle according 
to any one of claims 10 to 13, and a pharmaceutically acceptable carrier or diluent. 

15. An alphavirus structural protein expression cassette, comprising a 5' 
promoter which initiates synthesis of RNA from DNA, a nucleic acid molecule which 
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Claims 

We claim: 

1. An isolated nucleic acid molecule, comprising an altered alphavirus 
nonstructural protein gene which, when operably incorporated into a recombinant alphavirus 
particle, increases the time required to reach 50% inhibition of host-cell directed 
macromolecular synthesis following expression in mammalian cells, as compared to a wild- 
type alphavirus. 

2. An isolated nucleic acid molecule, comprising an alphavirus 
nonstructural protein gene which, when operably incorporated into a recombinant alphavirus 
particle, has a reduced level of vector-specific RNA synthesis, as compared to the wild-type, 
and the same or greater level of proteins encoded by RNA transcribed from the viral junction 
region promoter, as compared to a wild-type recombinant alphavirus particle. 

3. An alphavirus vector construct, comprising a 5' promoter which 
initiates synthesis of viral RNA in vitro from cDNA, a 5' sequence which initiates 
transcription of alphavirus RNA, a nucleic acid molecule which operably encodes all four 
alphaviral nonstructural proteins including a nucleic acid molecule according to claims 1 or 2, 
an alphavirus RNA polymerase recognition sequence and a 3 f polyadenylate tract. 

4. An alphavirus RNA vector replicon capable of translation in a 
eukaryotic system, comprising a 5* sequence which initiates transcription of alphavirus RNA, 
a nucleic acid molecule which operably encodes all four alphaviral nonstructural proteins, 
including a nucleic acid molecule according to claims 1 or 2, an alphavirus RNA polymerase 
recognition sequence and a 3* polyadenylate tract. 

5. A pharmaceutical composition, comprising an alphavirus RNA vector 
replicon according to claim 4 and a pharmaceutical^ acceptable carrier or diluent. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 
ATATAGGATC CTTCGCATGA TTGAACAAGA TGGATTGC 
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Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Leu Lys Arg Arg Glu 
65 70 75 80 

Arg Met Cys Met Lys He Glu Asn 

85 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
1 5 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Leu 
50 55 60 

Lys Arg Arg Glu Arg Met Cys Met Lys He Glu Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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15 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Val 
50 55 60 

Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Gin Lys Pro Lys Glu 
65 70 75 80 

Lys Lys Glu Asn Gin Lys Lys Lys Pro Thr Leu Lys Arg Arg Glu Arg 

85 90 95 

Met Cys Met Lys He Glu Asn 
100 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
1 5 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Val 
50 55 60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
15 10 15 

Arg Pro Ala Phe Arg Pro Trp Gin Val Ser Met Gin Pro Thr Pro Thr 
20 25 30 

Met Val Thr Pro Met Leu Gin Ala Pro Asp Leu Gin Ala Gin Gin Met 
35 40 45 

Gin Gin Leu He Ser Ala Val Ser Ala Leu Thr Thr Lys Gin Asn Val 
•50 55 60 

Lys Ala Pro Lys Gly Gin Arg Gin Lys Lys Gin Gin Lys Pro Lys Glu 
65 70 75 80 

Lys Lys Glu Asn Gin Lys Lys Lys Pro Thr Gin Lys Lys Lys Gin Gin 

85 90 95 

Gin Lys Pro Lys Pro Gin Ala Lys Lys Lys Lys Pro Gly Arg Arg Glu 
100 105 110 

Arg Met Cys Met Lys He Glu Asn 
115 120 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) 
Met 



SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
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(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 111: 
TATATCTCGA GGGTOSTGIT GTAGTATTAG TCAG 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:112 
TATATGATAT CAAAAAGCCT GAACTCACCG CGACG 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:113 
ATATAGGATC CTCAGTTAGC CTCCCCCATC TCCCG 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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CAATCTTCCT CACGCCTTAG T 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TCCTAAATA6 TCAGCATAGT A 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TCCTAAATAG TCAGCATAGT T 21 
(2) FORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GTIjOGTTTGT CGTGCACTG A ' 21 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CMTCTTCCT CACQOCTTAG C 21 
(2) INFORMATION FOR SEQ ID NO: 108: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
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ATCCGACC/G CAAAACTCGA TGTAfJTCGG A9GAACTGAT GTGCATAATC CATCA3GCTG 11460 

CTACATTAGA TCCCOQCTTA CCQOQQGCM TATAQCAACA CTAAAAACTC GATCTAOTC 11520 

GGAQGAAGCG CAGTGCATAA H3CTOCQCAG TCTTGCCACA TAACCACTAT ATTAACCATT 11580 

TATCTAGCGG AOQCCAAAAA CTCMTCFAT TTCTGAQGAA QjGTOSTGCA TMTGCCACG 11640 

CAGCGTCTGC ATAACTTTTA TTATTTGTT TATTMTCAA CAAAATTTTG TTTTTAACAT 11700 

TTCAAAAAAA AAAAAAAAAA AWWWW\ /VWWWW 11740 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Ser He Leu Gly Ser Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTrjCGTTTGT CGTCCAACTG C 



21 
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OGTTCCAAA TCTOCCACAG ATACCGTATA AQlEACTTCT TCA4A3QGCA GGGTATCCCC 10140 

CQCTCMTTT GGAGATCACT GTCATGTCCT CGGOGGTTTT GCCTTCCACC AACCAAGAST 10200 

ACATTACCTG CA4ATTCACC ACTGTOGTOC CCTCCCCAAA AATCAAATGC 7QDQQCTCCT 10260 

TQGAATGTCA QCCGQCCQCT CATOCASACT ATACCIGCAA QSTCTTGGSA GSGGTCTACC 10320 

CCnTATGTG GGGflQGAGCG CAAIUIIIII GtmmGA GAACAGCCAG ATCAGTCOGG 10380 

CGfTAGGTGGA ATJGTCAQCA GATTQGQCGT CTGACCAQQC GCA33GGATT AASGTGCACA 10440 

CTOC0G0GAT GAAAGTAGGA CTOGGTATTG TGTAQGGGM CACTACCAGT TTCCTAGATG 10500 

TCTAGGTGAA CGGAGTCACA CCAQGAACGT CTAAAGCTT GAAAGTCATA QCTGGACCM 10560 

TTTCAGCATC GTITAOQCCA TTCGATCATA AGGTCGTTAT CCATCGCGGC CTGGTGTACA 10620 

AOATCACTT CCOGGMTAT GGAGCGATGA AACCAQGflQC GTTTGG^GAC ATTCAAQCTA 10680 

CCTCCmSAC TAGCAAGGAT CTCATGGCCA GCACAGACAT TAGGCTACIC AAGCCTTCOG 10740 

CCAAGAA0GT GCATGTCCGG TACAGGCAQG CCTCATCA9G ATTTGAGATC TQSAAAAACA 10800 

ACTCAGGCCG CjCCACTGCAG GAAACGGCAC GTTQQGGTG TAAGATTQCA GTAAATOjGC 10860 

TCOGAGGGGT GGACTGTTCA TACGGGAACA TTCCCATTTC TAT7GACATC CCGAAGGCTG 10920 

CCTTTATCAG GACATCAGAT GCACCAOGG TCTCAACAGT CAAATGTGAA G7CAGTCAGT 10980 

GCACTTATTC AGCAGACTTC GGCGGGATGG GCACGCTGCA GTATGTATCC GACCQCGAAG 11040 

GTCAATGCCC CGTACATTOG CATTCGAGCA CAGCAACTCT CCAAGAGTCG ACAGTACATC 11100 

TCCTGGAGAA AGGAGCGGTC ACAGTACACT TTAGCADCGC 6AGTCCACAG GGGAACTTTA 11160 

TGGTATGGCT GTCTGGGAAG AAGACAACAT GCAAIGCASA ATUTAAACCA CCAGCTGACC 11220 

ATATCGTGAG CAC0C0GCAC AAAAATGACC AAGAATTTCA AG0GGCCATC TCAAAAACAT 11280 

CATGGAGTTG GCTGTTTQCC CTTTTCQQCG GOQCCTCGTC GCTATTMTT ATAGGACTTA 11340 

TGATTTTTGC TTGCAGCATC ATQCTGACTA GVOC&fG ATGACCGCTA 0GCCCCAATG 11400 
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OGTOCTTCAG CCCTG1TAAG ATGGAGCAGG TCTOGGAOGA /)QgQG0O&\T AACACCATAC 8760 

QCAT/OGC TTCCGCCCflG TTFGGATACG ACCAAAGGQG AQC/^GCAAGC GCAAACAAGT 8820 

ACGGCTACAT CTOGCTTAAG CAGGATCACA CCGTTAAAGA AQGCACCATC GATGACATCA 8880 

AGATTAGCAC CTCA3GAC0G TCTAGAAGGC TTAGCTACAA AQGMACTTT CTCCTCQCAA 8940 

MTGCCCTCC AS3GGACA9C CTAACGGTTA GCATASTCAG TAGCAACTCA GCAACGTCAT 9000 

GTACACIGGC CGGCAAGATA AAACCAAAAT TOGTQQGAOG GGAAAWAT GATCTACCTC 9060 

COGTTCACGG TAAAAAAATT CCTTCCACAG 7GTACGACGG TCTCAAAGAA ACAACTQCAG 9120 

GCTACATCAC TATGCACAQG COGASACCGC ACQCTTATAC ATCCTACCTC (WGAATCAT 9180 

CAGGGAAAGT TTAQGCAAAG CCQCCATCTC GGAAGAACAT TAOGTATGAG TGCAAGTGCG 9240 

GGGACTACAA GACCGGAAGC GTTRjGACCC GCADOGAAAT CACTGGTTGC ACOGCCATCA 9300 

AGCASTCOGT CGCCTATAAG AGOGACCAAA GGAAGTGQGT CTTCACTCA CCGSACITGA 9360 

TCAGACATCA OGACCACAOG OECCAAGGGA MTTGCATTT GCCTTTCAAG TTGATCCCGA 9420 

GTACCTGCAT GSTCCCTGTT GCCCACGCGC CGAATCTAAT ACATQQCTTT AAACACATCA 9480 

GCCTCCAATT AGATACAGAC CACTTCACAT TCCTCACCAC (7CGAGACTA QQQGCAAACC 9540 

GGGAACCAAC CACTGAATGG ATCGTIjQ^A AGAOGGTCAS AAAC1TCACC GTGGACGGAG 9600 

ATQXCTOGA ATACATATOG GGAAATCATC AGCCAGTGAG QGTCTATQCC CAAGAGTCAG 9660 

CACCASGASA CCCTCAGQGA TQGCCACACG AAATAGTACA GCA1TACTAC CATCQCCATC 9720 

OGICTACAC CATCTTAQCC GTCGCATCAG CTACOGTOGC GATGATCATT Q3CGTAACTG 9780 

TTGCAGTGTT ATGIGCCTCT AAAGCGOGCC GTCAGTGOCT GAQGCCATAC GCCCTOGCCC 9840 

CAAACGOOGT AATCGCAACT T0GCTQ3CAC TCTTCTGCTC CGTTAGGTCG GCCAATGCTG 9900 

AAAOGTTCAC CGAGAOCATC ASTTACTTCr GGTOGAACAG TCAGCCGTTC TTCTGGGTCC 9960 

AGTRJIQCAT ACCTTTOGCC GCTTTCATCG TTCTMTQQG CTGCTGCTCC TGCTGCCTCC 10020 

CTTTTTTAGT GGTTODOGGC GCCTACCTGG CGAAGGTAGA GQCCTAGGAA CATGGGAGCA 10080 
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A02CGAGW AGAOGAAGAC AGAAGAGGCG CTCTOCTASA TGAAACAAAG GOGTGGTTTA 7440 

GAGTA9GTAT AACAGGCACT TTAGCAGTCG CGGTCAOGAC COOGiTATGAG GTAGACAATA 7500 

TTACACCTGT CCTACTQQCA TTGAGAACTT TTCCCCAGAG CAAAAGAQCA TTCCAA3CCA 7560 

TCASAGGQGA AATAAAQCAT CTCTAOGSTG GTCCTAAATA GTCAGCATAG TACATTTCAT 7620 

CTCACTMTA CTACAACAGC ACCACCATGA ATAGAGGATT CTTTAACA7G CTOGGCOGCC 7680 

GCCCGTCCC GGCCCCCACT GCCATGTGGA GGCCQGGGAG AAGGAGGCAG GCGQCCGOGA 7740 

TGCCTGCCCG CAACGGGCTG GCTTCTCAM TCCAGOVCT GACCACAGCC GTCAGTQCCC 7800 

TAGTCATTQG AC/CGCAACT AGACCTCAAC CCCCAOGTCC ACGCCCGCCA CCGGGCCAGA 7860 

AGAAGCAGGC GCCCAAGCAA CCACOGAAGC OGAAGAAACC AAAAAOGCAG GAGAAGAAGA 7920 

AGAAGCAACC TGCAAAACCC AAACCCQGAA AGAGACAGOG CATGGCACTT AAGTTGGAGG 7980 

CCGACAGATT GTTOGACGTC AAGAACGAGG AGGGAGATCT CATGGQGCAC QCACTOQCCA 8040 

TOGAA9GAAA GGTAATCAAA CCTCTGCACG TGAAA9SAAC CATOGACCAC CCTGTGCTAT 8100 

CAAAGCTCAA ATTTACCAAG TOGTCAGCAT AGGACATOGA GTTOQCACAG TTGCCAGTCA 8160 

ACATGAGAAG TG4GGCATTC ACCTACACCA GTCAACACCC GGAAGGATTC TATAACIGGC 8220 

ACCAGGGAGC GG1GCAGTAT AGTGGAQGTA GATTTACCAT CCCTGGGGGA GTAGGAGGCA 8280 

GAGGAGACAG CGGTGGTCCG ATCA7GGATA ACTCCGGTGG QGTTGTCGQG ATAGTCCTCG 8340 

GTCGGGCTGA TGAAGGAACA OGAACTGCCC TTTOGGTCGT CACCTOGAAT AGTAAAG9GA 8400 

AGACAATTM GAGGACCCGG GAAGQGACAG AAGAGTQGTC OGCAGCACCA CTOGTCAGGG 8460 

GAAIGIGIII GCTGGGAAAT GTGAGCTTGC CATQOGACCG GCCGCCCACA TQCTATACCC 8520 

GCGAACCTTC CAGAGCCCTC GACATCCTTC AAGAGAACGT GAACCATGAG GCCTAGGATA 8580 

CCCTGCTCAA TQCCATATTG CGG1GGGGAT G57O0GCAG AAGCAAAAGA AGGGTCATTG 8640 

AGGACTTTAC CCTGACCAGC CCCTACTTGG QCACATOCTC GTACTOCCAC CATACTCTAC 8700 
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AGOGAOACT GTCAQGCTA OGACTGTATA ACTOQCCAC AGOTCAGCCA GAATCCTATA 6060 

AGATCACCTA TCCGAAACCA T7GTACTCCA CTA30GTACC Q30GAACTAC TCCGATCCAC 6120 

AGTTGGCTGT A3UGIUGI AACAACTATC TCCATGflGAA CTATCGGACA GTAGCATCTT 6180 

ATCAGATTAC TCAG3AGTAC GMQCTTACT 7GS4TATGGT GTOGCCTGCC 6240 

TQGATACTGC AACCJTCTQC CCCGCTAAQC TTAGAAGTTA CCCGAAAAAA CATGAGTATA 6300 

GAGCCCOGAA TATCOGCAGT GGGGTTCCAT CAGCGAJGCA GAACACGCTA CAAAA7GTGC 6360 

TCATTCCOGC AACTAAAAGA AATIQCAACG TCAOGCASAT GCGTGAACTG CCAACACTGG 6420 

ACTCAGGGAC A7TCMTGTC GAAT6CTTTC GAAAATATCC ATCTAATCAC GA6TATTQQS 6480 

AGGAGTTGGC T0GGAA9CCA ATTAGGATTA CCACTCAGTT TGTCACCGCA TATGTAGCTA 6640 

GACTGAAAGG CCCTAAGGCC GCCQCACTAT TTCCAAAGAC GTATAATTTG GTCCCATTGC 6600 

AAGAASTQCC TATGGATAGA TTCGTCATGG ACATGAAAAG AGAOGTGAAA GTTACACCAG 6660 

GCAGGAAACA CACAGAAGAA AGACGGAAAG TACAAGTGAT ACAAGGGGCA GAACCXTGG 6720 

CG03C1TA CTTATGOGGG ATTCACOGQG AATTAGTGCG TAGGCTTACG GCGGTOTGC 6780 

TTCCAAACAT TCACAGGCTT TTTCACATGT CGGCGGAGGA TTTTCATGCA ATCATAQCAG 6840 

AACACTTCAA GCAAQGOGAC CCGGTACTGG AGACGGATAT OGCATCATTC GACAAAAGGC 6900 

AAGAOGAOGC TATGGCGTTA ACCGGTCTGA TGATCTTGGA G3ACCTGGGT GTCGATCAAC 6960 

CACTACTCGA CTTGATOGAG TGOQCGTTG 6AGAAATATC ATCCACCCAT CTACCTAOGG 7020 

GTACTCGTTT TAAATTOGGG GGGATGATCA MTCOGGAAT GTTCCTCACA UIIIIGIC A 7080 

ACACAGTITT GAAIGTGG7T ATOQCCAGCA GAGTACTAGA AGAQOGGCTT AAAACGTCCA 7140 

GATGTGCAQC GTTCATTGGC GAGGACAACA TCATACATQG AGTAGTATCT GACAAAGAAA 7200 

THTTGAGAG GTCOGCCAOC TGGCTCAACA TQGA9GTTM GMCATCGAC GCAGTCATCG 7260 

G7GAGAGACC ACCTTACTTC TOOGQOGGAT TTATCTTCCA AGATTGGGTT ACTTCCACAG 7320 

OGTGCOGCGT GQGGGATCCC CTGAAAAGGC TGTTTAAGTT GGGTAAACOG CTCCCAGCCG 7380 
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CAAAAGGAAA ATTGTATTCG TACTTOGAAG QCACCAAATT 
TGSGGGASAT AAAQGTCCTC TTCCCTMTC ACCASGAAAG 
ACATATTGGG TCAGACCATG GAASCAATOC G3GAAAAGTC 
OCTCTAGCCC GCCCAAAACG TTGCGGTOX TTTGCATGTA 
TCCACAGCT TAGAA3CAAT AAGGTCAAAG AAGTTACAGT 
CTAAGCACAA AATTAAGAAT GTTCAGAAGG TTC/OQCAC 
OQCACACTCC CQCATTOGTT CQDGCCOCTA AGTACATAGA 
CTCCTCCTGC ACAGGCOGAG GA9QCCCCCG AAGTTGTAGC 
CTGATAACAC CTOQCTTGAT GTCACAGACA TOCACTGGA 
QCTCACTTTT TTOGflGCTTT AGCGGATGGG ACAACTCTAT 
CGTCASSACC TAGTTCACTA GAGATAGTAG ACCGAAGGCA 
ATOCCGTCCA AGAGCCTGCC OCTATTCCAC GGCCAAGGCT 
CAGOQGCAAG AAAAGAGCCC ACTCCACCGG CAAGCAATAG 
CTTTTQGTGG GGTATCCATG TCCCTOGGAT CAATTTTGGA 
CAGCGGTACA ACOCCTGGCA ACAGGCCCCA GGGATCTGCC 
CGGAGGGAGA GATTGATGAG OGAGCOGCA GAGTAAC1GA 
GATCATTTGA ACOQGGGGAA GTCAACTCAA TTATATOGTC 
CACTADGCAA GCAGAGACGT AGAGGCAGGA GCAGGAGGAC 
TAGG1GGGTA CATATTTTOG AGGGACACAG GCCCTGGGCA 
TOCAGAACCA GCTTACAGAA CCGACCTTGG AGOGCAATCT 
OGGTGCTOGA CAGGTOGAAA GAGGAACAAC TCAAACTCAG 
AAGCCAACAA AAGTAGGTAC CAG7UTCGTA AAGTAGAAAA 
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AGflGCATCCC ACTAAGGTAC (MXCGCOG ATTCA3CGAG GCCGGTAGCT CATIQGGACA 3360 

ACAQCCCflffi AACCOGCAAG TATQ3STA0G ATCAOGCCAT TGCGGCGGAA CTCTCCCGTA 3420 

GATTTCOGGT GTTCCAGCTA (CT3QGAAGG GCACACAACT TCATTTCSCAG ACGGQGAS4A 3480 

CCAGaGTTAT CTCIQCACAG CATAACC7GS TDCOQGTO^ CCGCAATCTT CCTCACQCCT 3540 

TAGTCCCGSA GTACAAGGAG AAGCAACCOG GCCOQCTCAA AAAATTGTG AACCAGTTCA 3600 

AACACCACTC AGTACTOG GTATCAGflQG AAAAAATTG4 AGCTCCCGGT AAGAGAATCG 3660 

AATGSATCGC aTG4TJGGC ATAQCCGGTG CASATAflGAA CTACAACCTG QCTTTCQGGT 3720 

TTCCGCGGCA QGCACQGTAC GACCTBGTCT TCATCAACAT TGGVCTAAA TACAGAAACC 3780 

ACCACTtTCA GCAGTGOGAA GACCATG0Q3 GGAOCTTAAA AACCCTTTCG GGTTCGGCCC 3840 

TGAATTGCCT TAACCCAGGA GSCAGCCT0G TCGTCAASTC CTATQQCTAC QCOGAO0QCA 3900 

ACAG1GAQSA OGTAGTCACC GCTCTTGCCA GAAAGTTTGT CAGQG7GTCT QCAGGGfiGAC 3960 

cagattgtgt ctcaaqcmt acagaaatct aqctgatttt (mam a gacaaqsgcc 4020 

GTACAQGGCA ATTCA00C0G CACCATCTGA ATIBDGTGAT TTGGTCGGTG TATGAQQGTA 4080 

CAAGAGATQ3 AGTTOGflGCC GCGCCGTCAT ACGQCACCAA AAQQGAGAAT ATTGCTGACT 4140 

GTCAAGAGCA AGCA6TTGTC AAQGCAGCCA ATC0GCTQQG TAGCCAQQC GAAGGAGTG" 4200 

GC0GTCCO4T CTATAAAOGT TGGCCGACGA GTTTTAOOGA TTCASCCACG GASACAQQCA 4260 

CCGCAAGAAT GACTGTGTGC CTAQGAAAGA AAGTGATCCA CGGQGTOQQC CCTGATTTCC 4320 

GGAAGCACCC AGAAGCAGAA GCCTTGAMT TCCTACAAAA CQCCTAOCAT GCAGTGQCAG 4380 

ACTTAGTAM TGAACATAAC ATCAA6TOG TCGOCATTCC ACTGCTATCT ACAGGCATTT 4440 

ACGCA5C0GG AAAAGACCGC CTTGAAGTAT CACTTAACTG OT&ACAACC GCGCTAGACA 4500 

GAACTGADGC GSADGTAADC ATCTATTCCC TGSATAAGAA GI6GAAGGAA AGAATCGACG 4560 

CGGCACTCCA AGTAAGGAG TCIGTAACAG PCCmm TGAASATATG GASAT0GAG3 4620 

ATCAGTTAGT ATOGATCCAT CCAGACAGn GCTTWQQG AiGWGQGA TTCAGTACTA 4680 
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GCAAACTATA GCACATTCCC ATGCA7GQCC CGGCCAAGAA TfOCM&G GAGCAGTACA 2040 

AGGTTACAAA GQCAGAGCTT GCA6AAACAG AGTAGGTGTT TGADGTOGAC AAGAAQCGTT 2100 

GOGTTAAGAA QGWMC TCAGGfTUGG TCCTCTCQGG /WCWC AACCCTOjCT 2160 

ATCATCflGCT /SGCTOGGflG QSCTGWA CCCGACCTQC GGTCCGGTAC AAGGTCGAAA 2220 

CAATAGGAGT GATAQQCACA GOEQGT0GG GCAA3TCAGC TA7TATCAAG TCAAC1GTCA 2280 

CQaCACGflG^ TCTTCTTACC AQCGGAAA6A AAGAAAAT7G TG9CGAAATT G^QGCOGCG 2340 

TOCTAAGACT GAGGGGTATC CAGATTAOGT CGAAGACA5F AGATTCGGTT ATOCTCAAOG 2400 

GATCCCACAA AQCCGTAGAA G7GCIGTAGG TT^OGAAQC GTTOQCGTOC CACGCAGGAG 2460 

CACTACTTOC OTCATTGCT ATGGTCAGGC CCCQCAAGAA GGTAGTACTA TO^GACC 2520 

CCATGCAATG (BGATIUTTC AACAIOTC AACTAAAGGT ACATTTCAAT CACCCIGAAA 2580 

AAGACATATG CAGCAAGACA TTCTACAAGT ATATCTCCCG GCGTTCCACA CAGCCAGTTA 2640 

CAGCTATTGT A7CGCACTG CATTACGATG GAAAGATGAA AACCACGAAC CCGTGCAAGA 2700 

AGAACATTCA AATCGATATT ACAGQGGCCA CAAAQCOGAA GCCAGGGGAT ATCATCCTGA 2760 

CATGTTTCOG CGQGTGBGTT A^GCAATTQC AAATCGACTA TCCCGGACAT GAAGTAATGA 2820 

CAGCCGOQQC CTCACAAGQG CTAACCAGAA AAGGAGTGTA TGCOGTCGQG CAAAAAGTCA 2880 

ATGAAAACCC ACTGTACGCG ATCACATCAG AGCAIGTGAA CGTGTTGCIC ACC03CACTC 2940 

AGGACAGGCT AGTCTGGAAA ACCTTQCAGG GDGAGCCA7G GATTAAGCAG CCCACTAACA 3000 

TACCTAAAGS AAAGTTCAG GCTACTATAG AGGACTGGGA AGOGAAOC AAGQGMTAA 3060 

TTQCTQCAAT AAACAGCCCC ACTCCCCGTC CCAATCCGTT CAGCTGCAAG ACCAACGTTT 3120 

GOQQQOGAA AGCATTOGM CCGATACTAG CCACGQCCQG TATDGTACTT ACCGGTTBCC 3180 

AGTGGAGCGA ACTGTTCCCA CAGTTTOGGG ATCACAAACC ACATTOGQOC ATTTAOGOCT 3240 

TAGAGGTAAT TTGCATTAAG TTTTTCGGCA TGGACTTGAC AAGCGGCTG TTTTCTAAAC 3300 
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amum catcttctcg gctatggcag gttcgtaccc tgcctacaac accaactoqg 660 

COGACGflGAA AGTCCT7GM GCGCGTAACA TOQGCTTTC (2GCACAAAG GGAGTCAAG 720 

CTAGG4CAGG AAAATTCTOG ATMTGAGGA AGAAGGAGTT G4AGCCCGQG TCGCGGGITT 780 

ATTTCTCCGT AGGA70GACA CTTTATCCAG AA(X2GAGC CAGCTTQCAG AGCIQQCATC 840 

TTCCATGGCT GTTCCACTTG AATQGftAAQC AGTOCTACAC TTGCCQCTGT GATACAGTOG 900 

TCAGTTG0GA AQXTA0GTA GTCftAGWA TCACCATCAG TCCGGSGATC ACQGGAGAM 960 

C0CTG3G4TA OGCGGTTACA CACAATAQ06 AQQQCTTCTT QCTATQCAAA GTTACTCACA 1020 

CAGTAAAAGG AGAACGQ3TA TCGTTCCCTG TCTOCAGGTA CATCCCGGCC ACCATATOGG 1080 

ATCAGATGAC TGGTATAA7G GGCAGGG4TA TATCACCT6A OGATQCACAA AAACTTUGG 1140 

TTQQGCTCAA CCAGCGAATT GTCATTAAQG GTAGGCTAA CAGGAACACC AACACCATQC 1200 

AAAATTACCT TCTGC0GATC ATAQCACAAG GGTTCAGCAA ATGGGCTAAG 6AGCQCAAQG 1260 

ATGA7UT7GA TAAQGAGAAA ATOT3GTA CTAGAGAAOG CAAGCTTACG TATQGCTQCT 1320 

"OQBGCGTT TCGCACTAAG AAAGTACATT QGTTTTATCG CCCACCTGGA ACGCAGACCT 1380 

GCGTAAAAGT CCCAGDCTCT T7TAG0GCTT TTCCCATGTC GTCOGTATQG ACGACCTCTT 1440 

TOG0CA7GTC GCTGASGCAG AAAT7GAAAC 7GXATTGCA ACCAAAGAAG GAQ3AAAAAC 1500 

TCCTGCASST CTCGGAQGAA TTAGTCA1T3G AQQCCAAGGC TGCTTTTGAG GATGCTCAGG 1560 

AGGAAGCCAG AGOGSAGAAG CTCGGAGAAG CACTTCCACC ATTAGTGGCA GACAAAGGCA 1620 

TCGAGGCAGC CQCAGAAGTT GTCTC0GAAG TQ3ASSS3CT CCAGSGGGAC ATCQGAQCAG 1680 

CATTAGTTCA AACCCCQCGC GSTCAOGTM GGATMTACC TCAAGCAAAT GACCGTATGA 1740 

TCQGACAGTA TATGGT7GTC TCGCCAAACT C1GTGCTGAA GAATOCCAAA CTDGCAOCAG 1800 

GGCACCCQCT AGCAGATCAG GTTAAGATCA TAACACACFC GGGAAGATCA GGAAGGTAGG 1860 

CGGTCGAflDC ATACGAOGCT AAAGTACTCA TGCCAGCAGG AGGTGCOGTA OCATOBOCAG 1920 

AATTCCTAQC ACIGAGTCAG AGCGGCACGT TAGTCTACAA CGAAAGAGAG TtTGTGAAGC 1980 



WO 97/38087 



PCT/US97/W010 



239 

TQOCTODCOG CACQGGCTC GOTCTCAAA TCCAGCAACT WXWGL CTCflGTQCCC 7800 
TAGTCATTGG ACASGCACT AGACCTCAAC CCCCADGTOC AOXICGCCA CCQCGCCASA 7860 

/mows: (Xccaagcaa. cdccgaagc (eaasaaacc aaaaacgcag gwgwa 7920 

AGAAGCAACC TBCAAAACCC AAACCCGGAA AGAGACA30G CAT3QCACTT AflGTTQSAQG 7980 
CCCVCASA.TT GTTOGfiCCTC .8000 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIM5S: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

ATTGACQQCG TACTACACAC TATTGAATCA AACAGCCGAC CAATTBCACT ACCATCACAA 60 

TOGAGAAQCC AGTAGTAAAC CTAG0CGTAG ACCCCCAGAG TCCGTTTCTC GTGCAACTCC 120 

AAAAAAQCTT CCGGCAATTT GAffiTAGTAG CACAGCA3GT CACTCCAAAT GCCATCCTA 180 

ATCCCAGAGC ATTTTD3CAT CTGGCCAGTA AACTMTCGA GCTQGAGGTT CCTACCACAG 240 

CGACGATCTT QGACATAGQC AG03CAC03G CTCGTASAAT GTTTTCCGAG CAOCAGTATC 300 

ATTCTCTCTG CCCCATQCCT AGTCCAGAAG ACCCGGACCG CAH2ATCAM TACGCCAGTA 360 

AACTCQCQGA AAAAG03TCC AAGATTACM ACAAGAACTT QCATGAWG ATTAAGGATC 420 

TC03GAC03T ACTTWACG CGGGATGCTC AAACACCATC GCTCTQCTTT CACAACGATC 480 

TTACCTQCAA CATGCGTQCC GAATATTCCG TCATGCAGGA GGTCTATATC AACQCTCCCG 540 

GAACTATCTA TCATCAGGCT ATCAAAGGCG TQCQGACCCT GTACTGGATT QGOTCGACA 600 
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TCATTQCCQC AACTAAAAGA MTTGCAACG TCA0GC4GAT QCGfTG¥CTC CCAACACTOG 6420 

ACTCAGCGAC ATTCAATGTC 6AATQCTTTC GAAAATATCC ATCTMTCAC GflCTATTCQG 6480 

AGGAGTTCGC TCGGAAGCCA ATTAGGATTA CCACIGAGTT TGTCACCGCA TATGTAGCTA 6540 

GACJGWGG CCCTAAQQCC GCCGCACTAT GTATMITTG GTCCCAT7GC 6600 

AAGAAGTCCC TA7GGATAGA TTOGTCATOG ACATCAAAAG AGflCGTGAAA GTTACACCAG 6660 

GCAGGWCA CACAGAAGAA AGACOGAAAG TACAAGTCAT ACAAQtXQCA 6WCCCTQG 6720 

OGACTQCTTA CTTATGOGQG ATTCAC0Q3G MTTAGTCGG TAGQCTTACG GCCGTCTTGC 6780 

TTCCAAACAT TCACA0GCTT TTTCACATGT CGGCGGAQGA TTTTGATGCA ATCATAGCAG 6840 

AACACTTCAA QCAAQQ0GAC GCGGTACTGG AGAGGGATAT OGCATCATTC GACAAAAGCC 6900 

AAGACGACX TATGGOGTTA ACCGGTCTGA TGATCTTGG4 GGACCTOOGT GTOGATCAAC 6960 

CACTACTCGA CTTCflJOGAG TGCGCCTTTG GAGAAATATC ATCCACCCAT CTACCTACGG 7020 

GTACTOGTTT TAMTTOGGG GGGATCATCA MTCOGGAAT GTTCCTCACA CTTTTTGTCA 7080 

ACACAGTTTT GAATGTOGTT ATCG0CAGCA GflGTACTAGA AGAGOGGCTT AAAA0GTCCA 7140 

GATGTGCAGC GTTCATTGGC GAGGACAACA TCATACATGG AGTAGTATCT GACAAAGAAA 7200 

TGGCTCAGAG GTGCGCCACC TQXTCAACA TOSAGGTTM GATCATCGAC GCAGTCATCG 7260 

GTCflGASACC ACCTTAOTC TCGGGOGGAT TTATCTTQCA AGATTCGGTT ACTTCCACAG 7320 

CGTGCCGGGT GGCGGATCCC CTGAAAAGGC TGTTTAAGTT GGGTAAACCG CTCCCAGCGG 7380 

AO^OGAGCA AGAGGAAGAC AGAAGACGGG CTC7GCTAGA TGAAACAAAG GOGTGGTTTA 7440 

GAGTAGGTAT AACAQGCACT TTAGCAGTOG CGGTGAGGAC GOGGTATCAG GTAGACAATA 7500 

TTACAGCTGT CCTACTQ3CA TTGAGAACTT TTGCCCAGAG CAAAAGAGCA TTCCAAGCCA 7560 

TCAGAQQQGA AATAAAGCAT CTCTAGGGTG GTCCTAAATA GTCAGCATAG TACATTTCAT 7620 

C1GACTMTA CTACAACACC AGCACCATCA ATAGAQGATT CTTTAACATG CTCGGCGGCC 7680 

GCCCCTTOCC GGGCCOCACT QCCATGTGGA GGCGGGGGAG AAQG fl Q3CAG GGGGGGCCGA 7740 
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GGCAC/OCC GGCATTGGTT GCOGCCGGTA AGTACATAG4 /OQCCAGflA CAQOCTADOG 5100 

CTCCTCCTGC ACAGGCCGAG GA990CCG0G AAGTTGTAGC GACAGOGTCA CCATCTACAG 5160 

CTGATAACAC CTGGCTTCAT GTCACAGACA TCTCACTGG<\ TATOGATGAC AGTAGGGAAG 5220 

GCTCACTTTT TTOGAGCTTT AGGGGATCGG ACAACTCTAT TACTAGTATG GACAGT7GGT 5280 

CGTCAQGACC TAGTTCACTA GAG4TAGTAG ACCGAAQGCA GGTGGTOSTG GOGflCGHTC 5340 

ATGCCGTCCA AGAGCCTGCC CCTATTCCAC CGCCAAQQCT AAASAAGA7G GCC0GCCTQ3 5400 

CAQOGQCAAG AAAAGAQCCC ACTCCACCQG CAAGCAATAG CTCT6AGTCC OCCACCTCT 5460 

CTTTTGGTOG GGTATCCATG TCCCTCGGAT CAATTTTOGA GGGAGAGACG GCCGQCCAGG 5520 

CAGGGGTACA ACCCCTOGCA ACAGGOCCCA GGGATCTOCC TATGTCTTTC GGATGGTTTT , 5580 

CCGA0GGAGA GATTGAH^G OGAG00GCA GAGTAACTGA GTCCGAACCC GTC LIGI 1 1G 5640 

GATCATTTGA ACGGGGOGAA GTGAACTCAA TTATATGGTC CGGATCAGCC GTATCTTTTC 5700 

CACTAGGCM GCAGAGACGT AGAGGCAGGA GCAGGAGGAC TGAATACTGA CTAACCGQGG 5760 

TAGGTGGGTA CATATTTTGG AOGGACACAG GCCCTOQQCA CTTGCAAAAG AAGTCGGTTC 5820 

TQCAGAACCA GCTTACAGAA COGAGCTTGG AGOGCMTGT CCTGGAAAGA A1TCATGCCC 5880 

CGGTQCTOGA CAGGTCGAAA GAGGAACAAC TCAAACTCAG GTACCAGATC ATQCCCACOG 5940 

AAGCCAACAA AAGTAGGTAC CAGTCTCGTA AASTASAAAA 7CAGAAAGCC ATAACCACTG 6000 

AG0GACTACT GTCAGGACTA GGAC7GTATA ACTUGCCAC AGATCAGCCA GAATCCTATA 6060 

AGATCACCTA TGGGAAAOCA TTGTACTCCA GTAGCGTACC GGCGAACTAC TCCGATCCAC 6120 

AGTTOGCTGT AGCIGTU1GT AACAACTATC TOCATGAGAA CTATCGGACA GTAGCATCTT 6180 

ATCAGATTAC TGACGAGTAC GATGCTTACT TGGATATGGT AGACGGGACA GTCGCCTGCC 6240 

TGGATACTGC AACCTTOSC GCOGCTAAGC TTAGAAGTTA CCCGAAAAAA CATGAGTATA 6300 

GAGGCCCGAA TATCCQCAGT GCGGTTCCAT CAGOGATGCA 6AACAGGCTA CAAAATGTGC 6360 
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AATQGATCGC CCOGATTQGC ATAGCOGGTG (MTAP\ CTACACCTC GCTTT0QQG1T 3720 

TTCCGCCGCA CDXGGTAC GACCFBGTGT TCATCAACAT TOMTAAA TACAGAAACC 3780 

KIACTTTCA GCAGTCCGAA GACCATCOQG (MXTAM AACCCTTTCG CGTTOQGOCC 3840 

TCAATTCCCT TACCCAGGA GGCACCCTCG TGGIGAAGTC CTATQGCTAC GCGGAXGCA 3900 

AOCTSAGGA (CTAGTCACC GCTCTTCCCA GAAAGTT7GT CAGGGIGTCT GOGGGAGAC 3960 

CAGATTGTGT CTCAAGCAAT /OCAAATCT ACCTCATTTT CCGACAACTA GACAACAGGC 4020 

GTACA0GGCA ATTCACCCGG CACCATCTCA ATTCCGTCAT ITCGTCCGTG TATGAGGGTA 4080 

CAAGflGATQG AGmaGflQCC QCGC0GTCAT ACOQCACCAA AflGGGAGAAT ATTQCTGCT 4140 

CTCAA6AGGA AGCAGTTGTC AACGCAGCCA ATCCGCTQQG TAGACCA9X GAAQGAGICT 4200 

GCCGTOCCAT CTATAAA0GT H3QCCGACCA GTfTTACOGA TTCAGOCACG GAGACAGGCA 4260 

C03CAAGAA.T GAC7GTGTOC CTAGGAAAGA AAGTGATCCA CGCGGTGGGC CCTGATTTCC 4320 

QSAAQCACOC AGAAGCAGAA QCCTTGAAAT TGCTACAAAA CQCCTACCAT GCAGTCGCAG 4380 

ACTTAGTAAA TGAACATAAC ATCAAGTCTC TCGCCATTCC ACTGCTATCT ACAQQCATTT 4440 

AOGCAGCOQG AAAflGACCQC CTTGAAGTAT CACTTAACTC CTTGACAACC GOGCTAGACA 4500 

GAACTGflCGC GGAOGTAACC ATCTATTGCC TQGATAAGAA GTGGAAGGAA AGAATCGAGG 4560 

GGGCACTCCA ACTTAAQGAG TCTGTAACAG A3C7GAAQGA TGAAGATATG GAGATCGACG 4620 

ATGAGTTAGT ATQGATCCAT CCAGACAGTT GCTTCM33G AAGAAAQQGA TTGAGTACTA 4680 

CAAAAGGAAA ATTGTATTOG TACTTOGAAG GCACCAAATT CCATCAAGCA GCAAAAGACA 4740 

TOGGGSAGAT AAAQGTCCTC TTCCCTMTG ACCAGGAAAG TAATGAACAA CIGTGTGCCT 4800 

ACATATTGGG TGAGACCATG GAAGCAATCC GOGAAAAGTC CGGGGTOGAC CATAACCCGT 4860 

GCTCTAGCCC GOXAAAADG T7GCCGTGCC TTTGCATCTA TGGCATGACG CCAGAAAGGG 4920 

TCCACAGACT TAGAAGCAAT ACGTCAAAG AAGTTACAGT ATQCTCCTCC ACCCCCCTTC 4980 

CTAAGCACAA MTTAAGAAT GTTCAGAAGG TTCAGTGCAC GAAAGTAGTC CTGTTTAATC 5040 
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vnma g*ggggtatc cagattacgt ggaascast /wtcggtt atgctcms 2400 

GAIGCGCAA /*GOGGTAGAA GTGCTGTACG T7&A0G4A3C GTTGGCGTCC CAOGCAQGOG 2460 

CACTCnOG CTTCATTGCT ATCGTCAGGC CCCGCAAGV\ QGBGT/OA TGOQGflGAOC 2520 

CCATCCMTC GGGATTCTTC ACATGMGC AACTAAAQGT ACATTTCAAT C/CCCTGAAA 2580 

AAGflCATATG UONOCA TTCT/CAAGT ATATOCCGG GOGTTOCACA OOjCAGTTA 2640 

CAGCTATTGT ATGGACACIG CATTACGATG GAAffiATCAA ACCACGAC CCGTGCAAGA 2700 

AGA4CATTGA MTOGATATT /OG333CCA CAAAGCOGAA QCC/COGGAT ATCATCCTGA 2760 

CATGTTTCCG GGGGTGGSTT AflGCAATTGC AAATGGCTA TCC0Q(XAT GMGTAATCA 2820 

OTCCGOGGC CKXM3QG CTAACCAGAA A^GTCTA TCCCGTCCQ3 CAAAAAGICA 2880 

ATGAAAACCC OT/CQCG ATCACATCAG AGCATCTGAA OGTUTTCCTC ACCCGCACTG 2940 

tmOW /CTGTCGAAA /OT7GCAG3 QCGCCCATG GAJTAAGCAG CCCCTAACA 3000 

TA&TAAAGG AAACTTTCAG GCTACTATAG AGGACTGGSA f&JGAfCAC AAGQGAATAA 3060 

TIGCTGCAAT AAACAGCCGC /CTCCCGGTG CCAATCCGTT CAGCTGCAAG ACCAAOGTTT 3120 

GCTQGG0GAA A3CATTQSAA (XOTCTAG CCACGGCGGG TATOGTACTT /COGGTTGCC 3180 

AGTGGflGCGA /CTGTTCCCA CAGTTTCGGG AIGACAAACC ACATTOGGCC ATTTAGGCCT 3240 

T/GflOGTAAT TTGCATTAAG TTTTTCQGCA TGSCTTC^ AAGCGGACTG TTTTCTAAAC 3300 

AGAGCATCCC /HAACGTAC CATDC0QCCG ATTCAQCGflG GCCGGTAGCT CATTG9GCA 3360 

ACAGCCCAGG AACOCGCAAG TAIGGGTACG ATCACGODAT TGCCGOCGAA CTCTCC0GTA 3420 

GATTTCCQGT GTTOCAGCTA GCTGGGAAGG GCACACMU TGATTTGCAj A0GQGGAGAA 3480 

OCAGflGTTAT CTCTGCACAG CATAAGCTGG 7CC0GGTGAA CCGCAATGT CCTCACGOCT 3540 

IflGTCOCOGA GTACAAGGAG NWWXG GCCCGGTCAA AAAATTCTTG AACCAGTTCA 3600 

PKKCACTC AGTACTTCTG GTATCAGAGG AAAAAATO AGCTCGCGGT AAGAGAATOG 3660 
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cammA gsoggttaca (xaatagcg xxmrcn .qctaigwm gttactgaca 1020 

CAGTAAAA3G AGAAGGGGTA TCGITCCCTC TGTOCACGTA CATCCCQQCC ACCATA7GC6 1080 

ATCASA7GAC TQGTATM7G GCCAGGGATA TATGACCTGA GGAJGCACAA AAACTTUT13G 1140 

TTGGGCTCM CCAQGGAATT GTCATTAACG GTA9GACTM CAGGAACACC AACACCATCC 1200 

AAAATTACCT TCTOCCGATC ATAGCACAAG GGTTCAGCAA ATGQGCTAAG GAGCGCAAGG 1260 

ATGATC17G<\ TAAGGAGAAA AIGCTGQGTA CTAGAGAAGS CAAGCTTAOG TATGGCTC5CT 1320 

TCTCQGOGHT TCGCACTAAG AAAGTACATT CGTTTTATGG CCCACCTQGA ACQCAGACCT 1380 

GCGTAAAAGT CCCAGCCTCT TTTAGCQCTT TTCCCATGTC GTC0GTATGS AGGACCTCTT 1440 

TCCCCAIGTC GCTBAQGCAG AAATTGAAAC TO3CATTGCA ACCAAAG4AG G^GGAAAAAC 1500 

TCCTGCAGGT CTCQGAQGAA TTAGTCATOG AGGCCAAGGC TGCTTTTGAG GATGCTCAGG 1560 

AGGAAGCCAG AQCGGAGAAG CTCCGAGAAG CAC7TCCACC ATTAGTCGCA GACAAAQGCA 1620 

TCGAGGCAGC GGCAGAAG1T GTUTGCGAAG TGGAG3G3CT COOQQCQGAC ATOGGAGCAG 1680 

CATTAGTTCA AACCOCGGQC QGTCACGTAA GGATAATACC TCAAGCAAAT GACCGTATGA 1740 

TCGGACAGTA TATCG1TCTC TCGCCAAACT CTCTQCTBAA GMTGCCAAA CTOGCACCAG 1800 

OGCACOOGCT ASCAGATCAG GTTAAGATCA TAACACACTC CGGAAGATCA GGAAGGTACG 1860 

CGGTGGAACC ATAOGACGCT AAAGTACTGA TGCCAGCAGG AGGTQCGGTA CCATG3CCAG 1920 

MTTCCTAGC ACTGAGTCAG AGGGCCACGT TAGTCTACAA GGAAAGAGAG TTTCTGAACC 1980 

GCAAACTATA CCACATTGCC ATGCATQGCC GOGCCAAGAA TACAGAAGAG GAGCAGTACA 2040 

AGGTTACAAA GGCAGAGCTT GCAGAAACAG AGTACGTCTT TGAOGTOGAC AAGAAGOGTT 2100 

GGGTTAAGAA GGAAGAAQOC TCAGGTCTOG TOCTCT03QG AGAACIGACC AACCCTOCCT 2160 

ATCATGAXT AGCIUTQGAG GGACTGAAGA CCCGAOCTGC GGTCCGGTAC AAGGTCGAAA 2220 

CMTAGGAGT GATAQGCACA CCGQGGTCGG GCAAGTCAQC TATTATCAAG TCAAC1TSTCA 2280 

GG32AOGAGA TCT7GTTACC AQGQGAAAGA AAGAAAATTG TGGGGAAATT GAGGCOGACG 2340 
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(i) SEQUENCE CHARCTERISTICS: 

(A) LEUGTH: 8000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

ATTCVCQQCG TAGT/tACAC TATTGAATCA AACA9CCGAC CAATTQC/CT ACCATCACAA 60 

TGGAGAA9CC AGIAGTAAAC CTOGGTAG MXHXJ&G TC0CTTTGTC GTQCAACTGC 120 

AAAAAAQCTT CCCQCAATTT GflQGTAGTAG CCTCCAW GACCAT13CTA 180 

ATGCCA6AGC ATTTTOQCAT CTQQCCAGTA AACTAATCGA QCTGS^GTT CCTACCACAG 240 

CGAC6ATCTT QGACATAGGC A90GCACCQG CTCGTASAAT GTTTTCCGAG CACCACTATC 300 

ATTGTGTCTG CCCCATGCGT AG7CCAGAAG ACCOQGACCG CA7GAT6AAA TAOQCCAGTA 360 

AACTGSCGGA AAAAQOGTGC AAGATTACM ACA^GAACTT GCATGAGW ATTAAQGATC 420 

TC0G3ACCGT ACTTGATACG CCQGATQCT6 AAACAOCATC QCTCTGCTTT CACAAC6ATG 480 

TTACCTOCAA CATGCGTOCC 6MTATTCCG TCATQCAGGA OGTGTATATC AACGCTCCCG 540 

GWTATCTA TCATCAQQCT ATGWGQCG TQCQGACCCT CTACTOGATT GQCTTCGACA 600 

CCflCCCAGTT CATGTTCTOG GCTATGGCAG GTTOGTACCC TQCGTACAAC AOCAACTQQG 660 

CCGACGflGAA AGTCCTTGAA GOQCGTAACA TCQGACTTTG CAQCACAAAG CTCOGWG 720 

GTAQGACAQG AAAATTGTCG ATAATGAGGA /CftAQGAGTT OVGCCCG9G T0GO3QGTTT 780 

ATTTCTOCGT AGCATCGflCA CTTTATCCAG AACAT/GAQC C/OTGCflG AGCTGQCATC 840 

TTDCATOQGT GTTCCCTTG AATQGWGC AGT03TACAC TTGCCGCTGT GATACAGTG3 900 

TGflGTTOOGA AG3CTACGTA GTGAA3AAAA TCACCATC/G TCCCQQGATC AG33G/ S GAAA 960 
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GGCTGCTTA CTTATGCGSG ATTCACOGQS MTTAGTGOG. TAGGCTTAGG GCCGTCTTGC 6780 

TTCCAAACAT TCACAGGCTT TITGCATGT 033CGSAGGA TTTTGAIGCA ATCATAGCAG 6840 

AACC7TCM GCAAGGOGAC C0Q3T/O3G AGAjCGGATAT CQCATCATTC GACAAAAGCC 6900 

AAGACGAOX TATQQCGTTA AC03GTC1GA TGATCTTGGA GGACCTQGGT CTGSATCAC 6960 

CACT/ODGA CTTGATCGAG 7GCGCCTTTG GAGAAATATC ATOCACCCAT CTACCTACQG 7020 

GTAO0GTTT TAAA7TCGGG G0GA.7GATGA AATCCGGAAT GTTCCTCACA. CIIIIIG IGA 7080 

ACACAGTTTT GAATGTGGTT ATOXCAGCA GflGTACTAGA AGAGCGGCTT AAAA0GTCCA 7140 

GATCTOCAGC GTTCATTOQC GACGACAACA TCATACATQG AGTAGTATCT GACAAAGAAA 7200 

TGGCTGAGAG GTOOGCCACC TQGCTCAACA TGGAQGTTM GATCATOGAC GCAGTCATCG 7260 

GTGAGAGACC ACCTTACTTC TOOGQOQGAT TTATCTTGCA AGATTCGSTT ACTTCCACAG 7320 

OGTGCGXGT GXGGAICCC CTGAAAAGQC TGTTTAAGTT GQGTAAACGG CTCCCAGCCG 7380 

AGGADGAGCA A&CGAAGAC AG4AGACGGG CTCTGCTAGA TGAAACAAAG GOGTQGTTTA 7440 

GAGTAGGTAT AACflGGCACT TTAGCAGTGG CCGTGACGAC CGGGTATGAG GTAGACAATA 7500 

TTACACCTGT CCTACTQGCA TTGAGAACTT TTGCCCAG^G CAW&GCA TTCCAAjCCA 7560 

TCAGAGG9GA MTAAAGCAT CTCTAQGGTG GTCCTAAATA GTCAGCATAG TTCATTTCAT 7620 

□GCTMTA CTACAACADC ACCACCATGA ATAG0G6ATT CTTTAAGATG CTCGGG0GCC 7680 

GCCCCTTCCC QGCCCCCACT GCCATGTGGA GGCG30QGAG AAGGAQGCAG G0QGCCCCGA 7740 

TGCOQQOCG CAAGGGGCTG GCTTCTCAAA TCCAGCACT GACCACAG0C GTCAGTCCCC 7800 

TAGTCATTCG ACAG3CAACT AGACCTCAAC CCCCATGTCC AGQCCCGCCA 00G0GCCAGA 7860 

AGAAGCAGGC GCCCAAGCAA CCACCGAAGC CGAAGAAACC AAAAACGCAG GAG¥GAAGA 7920 

AGAAGGAACC TGCAAAACCC AAACCCQGAA AGAGACAGOG CATGQCACTT AAGTTGSAGG 7980 

CCGACAGAIT GTTDGAOGTC 8000 
(2) INFORMATION FOR SEQ ID NO: 102: 
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CA20GGCAAG AAAAGASCCC ACTCCACGQ3 CAA9CAATAG CTCIGflGTCC CTGCACCTCT 5460 

CTTTTOG1QG GGTATCCATG TCCCTGGGOT CAATnTOGA OGGAGflGAOG GCCOGCCAGG 5520 

CAGGGGTACA AGGCCTGQCA ACAGGGCCCA OQGATCTOX TA7G7UTTTC GGATOGTTTT 5580 

CCGAOGGAGA GATTCATGAG C1GAGC0GCA GAGTAACIGA GTGGGAACCC GTCCTG7T7G 5640 

GATCATTTGA ACOQQGOGAA GTGAACTCAA TTATATGGTC CCG4TCAGCC GTATCTTTTC 5700 

CTCTAOGCAA GC^GAGAGGT AGAGGCAGG4 GCAGGAGGAC TGAATAC7GA CTAAOCfflGG 5760 

TAGGTGGGTA CATATTTTCG ACQGACACAG QCCCTQQQCA CTJQCAAAAG AAGTGOGTTC 5820 

TCCAGAACCA GCTTACAGAA CCGACCTTCG AGCACAATGT COQGAAAGA ATTCATQCCC 5880 

OQGTCCTCGA CAOGTGGAAA GAGGAACAAC TCAAACTCAG GTACCAGATG ATCCCCACCG 5940 

AAGCCAACAA AAGTAGGTAC CAGTCTGGTA AAGTAGAAAA TCAGAAAGCC ATAACCACTG 6000 

AGGGACTACT GTCASGACTA CGACTGTATA ACTOSCCAC A5ATCA9CCA GAATGCTATA 6060 

AGA7CACCTA TCCGAAACCA TTGTACTGCA GTAGOGTACC GQGGAACTAC TCGGATCCAC 6120 

AGTTCGCIGT AGCTGTCTGT AACAACTATC TGCATGAGAA CTATCCGACA GTAGCATCTT 6180 

ATCAGATTAC TGAGGAGTAC GATGCTTACT 7GGATATBGT AGAOGQGACA GTCGCCTGCC 6240 

TGGATACTGC AACCTTCTGC CCCGOAAQC TTAGAAGTTA CCOGAAAAAA CATGAGTATA 6300 

GAGCCCG3AA TATCOQCAGT GCGGTTCCAT CAGCGAIGCA 6AACAGQCTA CAAAATGTGC 6360 

TCATTGCCGC AACTAAAAGA AA1TQCAACG TCACGCAGAT GGGIGAAC7G CCAACAC7GG 6420 

ACTCAGCGAC ATTCAATGTC GAATOOTTC GAAAATATGC A1GTAATCAC GAGTATTCGG 6480 

AGGAGTTCQC TOGGAAGCGA ATTAGGATTA CCACIGAGTT TGTCACOQCA TATGTAGCTA 6540 

GACTGAAAGG 0CCTAA9GCC GCGGQOAT TTGCAAAGAC GTATAATTTG GTCCCATTGC 6600 

AAGAAGTCOC TATBGATAGA TTOGTCATGG ACATCAAAAG AGACGTGAAA GTTACACCAG 6660 

GCAQGAAACA CACAGAAGAA AGACCGAAAG TAGAAGTGAT ACAAGCCGCA GAACCCCTGG 6720 
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GTACACGGCA ATTCACCCCG CACCATCTGA ATTBDGW. TTOGTOjGTC TATGAGGGTA 4080 

CAAGAGA7GG AGTTGGAGCC GOQCCGTCAT ACCGCACCAA AAGGGAGAAT ATTGCTGACT 4140 

GTCAAGAGGA AGCAGTTGTC AAOGCAGCCA ATCCGCTOQG TAGACCAGQC GAAGSAGTCT 4200 

GCOGTGCCAT CTATAAACGT TBGCGGACCA GTTTTACCGA TTCAGCCACG GTOtt 4260 

CGGCAAGAAT GACTG757GC CTAGGAAAGA AAGIGATCCA CQOGGTOQQC CCTGATTTCC 4320 

GGAAGCACCC AGAAGCAGTA GCCTTCAAAT TGCTACAAAA CQCCTACCAT.GCAGiraGCAG 4380 

ACTTAGTAM TGAACATAAC ATCAAGTCTG TOQCCATTCC ACTGCTATCT ACAGGCATTT 4440 

AGGCAGCCGS AAAAGACGGC CTTGAAGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

6AACTGA0GC GGAOGTAACC ATCTATTGCC TGGATAAGAA GTQGAAQGAA AGAATCGAOG 4560 

CGGCACTCCA ACTTAAGGAG TC1GTAACAG AGCIGAAGSA TGAAGATATG GAGATOGAOG 4620 

ATGAGTTAGT ATGGATCCAT CCAGACA3TT GCTIGAAGGG AAGA4AG9GA TTCAGTACTA 4680 

CAAAAGGAAA ATTGTATTGG TACTTGGAAG GCACCAAATT CCATCAASCA GCAAAAGACA 4740 

TQGCGGAGAT AAAGGTCCTG TTCCCTMTC ACCAG3AAAG TAA1GAACAA CTGTGTGCCT 4800 

ACATATTGGG TGAGCCATC GAAGCAATCC GCGAAAAGTG CCCQGTCGAC CATAACCOGT 4860 

CGTTTAGCCC GCCCAAAAGG TTGCCGTQCC TTTCCATGTA TOCCAT6ACG CCAGAAAGQG 4920 

TCCACAGACT TAGAAGCAAT AAGGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CTAAGCACAA MTTAAGAAT GTTCAGAAQG TTCAGTCCAC GAAAGTAGTC CTGTTTMTC 5040 

CGCACACTCC GSCATTOGTT CCCGCCCGTA AGTACATAGA AGTGCCAGAA CAGCCTACCG 5100 

CTCCTCC7GC ACAGQCCGAG GAQQCCCCCG AAGTTGTAGC GACACCGTCA CCATCTACAG 5160 

CTGATAACAC CTCQCTTGAT GTCACAGACA TCTCACTGGA TATGGATCAC ASTAGCGAAG 5220 

GCTCACTTTT TTGQAGCTTT AGGGGATGGG ACAACTCTAT TACTAGTAIG GACAGTTCGT 5280 

CGTCAQGACC TAGTTCACTA GAGATAGTAG ACOGAAGGCA GGTQGTGGTG GCTGAOGTTC 5340 

AIGCCGTCCA AGAGCCTGCC CCTATTCCAC CGCCAAGGCT AAAGAAGAIG GCOOGCCTGG 5400 
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7GTGGG0GTT TCGCAGTAAG AA4GTACATT CGTTTTATCG CCCACCTGGA ACSCAGACCT 1380 
GGGTAAAAGT CCCAQCTTCT 7TTAS0GCTT TTCCCAIGTC GTCCGTATGG ACGACCTCTT 1440 
TGCCCATCTC GCTGAGGCAG AAAT7GAAAC TOGCAT7GCA ACCAAAGAAG GAGGAAAAAC 1500 
7GCTGCAGGT CTCGGAQGAA TTAGTCATI3G AGGCCAAGSC TGCTTTTGAG GA7GCTCAQG 1560 
A3GAAGCCAG AGOQGAGAAG CTCCGAGAAG CACTTCCACC ATTAGTGGCA GACAAAGQCA 1620 
TCGfiQQCAGC CGCAGAAGTT CTOGGGAAG TOGAGGGGCT CCAGGCQGAC ATCGGAGCAG 1680 
CATTAGTTGA AACCCOGOGC GGTCAGGTAA QGATMTACC TCAAGCAAAT GCCGTATGA 1740 

TGQGACAGTA TATCGT7GTC TCGCCAAACT CTCTACTGAA GAATGCCAAA CTCGCAGCAG 1800 

0GCACCCGCT AGCAGA7CAG GTTAAGATCA TAACACACTC GGGAAGATCA Q3AAGGTAGG I860 

GGGTOGAACC ATAOGACGCT AAAGTACIGA 7GCCAGCAGG AQGTQCOGTA CCATQQCCAG 1920 

MTTCCTAGC AOGAGTCAG AGOGCCAOGT TAGTGTACM OGAAAGAGflG CTTC7GAACC 1980 

GCAAACTATA CCACATTGOC ATOCATOGCC CGTCAASAA TACA3AASAS 6AGCAGTACA 2040 

AGGTTACAAA GGCAGAGCTT GCAGAAACAG AGTA0G1G1T TGAOGTOGAC AAGAAGGGTT 2100 

GCGTTAAGAA GGAAGAAGCC TCAQGTOQG TCCTCTGG3G AGAACTCACC AACCCTCCCT 2160 

ATCATCAGCT AGCTCTGGAG GGACTCAAGA CCCGAOCTCC QGTCCCGTAC AAGGTGGAAA 2220 

CAATAGGAGT GATAGQCACA CCGQQGTCGG GCAAGTCAGC TATTATCAAG TCAACTCTCA 2280 

GGGCAOGAGA TCTTGTTACC AGCQGAAAGA AAGAAAATTG TG30GAAATT 6AGGCCGA0G 2340 

TCCTAAGACT GAQGQGTATG CAGATTAQGT CGAAGACAGT AGATTCG5TT ATQCTCAAOG 2400 

GA7GCCACAA AGCCGTAGAA GTGCTGTAOG T7GACGAAGC GTTCQGGTCC CAQGCAQGAG 2460 

CACTACTTGC CTTCATTCCT ATCGTCAQQC CGOGCAAGAA QGTAGTACTA TGQGGAGACC 2520 

CCATCCAATC OQGATTCTTC AACATCAIGC AACTAAAGGT ACATTTCAAT CAQCCTGAAA 2580 

AAGACATATG CACCAAGACA TTCTACAAGT ATATCTOCCG GOGTTGCACA CAQCCAGTTA 2640 

CAGCTATTGT ATCGACACTG CATTACGATC GAAAGATGAA AACCACGAAC CCGTCCAAGA 2700 
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• ATIGAOGGCG 1PGIMWC TATTCAATCA 
TGGAGAASCC AGTAGTAAAC GTAGACGTAG 
AAAAAAGCTT CCCGCMTIT GA9GTAGTAG 
A7GCCASAGC ATTTTOGCAT CTCGCCAGTA 
OGACGATCTT QGACATAGGC AGOGCACCQG 
ATTGTGTCTG GOCCATGGGT AGTCCASAAG 
AACTGGCGGA AAAAGCGTGC AAGATTACAA 
TCCGGACOGT ACTTGATACG CCGGATGCTG 
TTACCTQCAA CATGOGTOCC GAATATTCOG 
GAACTATCTA TCATCAGGCT ATGAAAGGOG 
COCCCAGTT CATGTTCTCG GCTATGGCAG 
CCGACGOGAA AGTCCTTGAA GCQCGTAACA 
GTAGGACASG AAMTTGTCG ATAATCA9GA 
ATTTCTCCGT AGGATCGACA CTTTATCCAG 
TTCCATCGGT GTTCCACTTG MTCGAAAGC 
TGAGT7GCGA AQGCTAGGTA GTGAAGAAAA 
CCGTGQGATA OGOGGTTACA CACAATASGG 
CAGTAAAAQG AGAAQQGGTA TCGTTCCCTG 
ATCAGATCAC TCGTATMTG GCCAOGGATA 
TTG3XTCAA OCAGCGAATT GTCATTAAGG 
AAAATTACCT TCTGOCGATC ATAGCACAAG 
ATGATCmSA TAAGGAGAAA AIGCTGGGTA 



AACAGCCGAC 


CAATCGCACT 


ACCATCACAA 


60 


ACCCCCAGAG 


TOiGIIIGIC 


GTCCAACTGA 


120 


CACAGCA5GT 


CACTCCAAAT 


GACCATCCTA 


180 


AACTAATOGA 


GCTGGAGGTT 


CCTAOCACAG 


240 


CTCGTASAAT 


GIIIICOGAG 


CACCAGTATC 


300 


ACC06GACCG 


CA1GATGAAA 


TAGGCCAGTA 


360 


ACAASAACTT 


GCAIGAGAAG 


ATTAAGGATC 


420 


AAACACCATC 


GCTCTGCTTT 


CACAAOGATG 


480 


TCATCCAGGA 


OGTGTATATC 


AACGCTCCOG 


540 


TQjQGACCCT 


GTACTGGATT 


GQCTTGGACA 


600 


GTTCGTACOC 


TOOGTACAAC 


ACCAACTCGG 


660 


TCGGAUIIG 


CAGCACAAAG 


CTCAGTCAAS 


720 


AGAAGGAGTT 


GAAGCOCGGG 


TnfinGGGTTT 


780 


AACACAGAGC 


CASCTTGCAG 


AGCTOGCATC 


840 


AGTCGTACAC 


TTOCCGCTGT 


6ATACAGTGG 


900 


TCACCATCAG 


TOCCGGSATC 


ACGGGAGAAA 


960 


AGGGCTTCTT 


QCTATGCAAA 


GTTACTGACA 


1020 


TCTGCAGGTA 


CATCCCGGCC 


ACCATATGCG 


1080 


lAlC/LUbA 


/Y*AT/"Y*APAA 

CEAIGLAtAA 


AAA/ ' 1 1/ ' IfY^ 


1140 


GTAGGACTAA 


CAGGAACACC 


AACACCATGC 


1200 


GGTTCAGCAA 


ATGGGCTAAG 


GAQGQCAAGG 


1260 


CTAGAGAAOG 


CAAGCTTAGG 


TATQGCTGCT 


1320 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:99: 
OGCGCCGAAT TGTGAGCGCT CACAATTCGG 30 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Pro Lys Lys Lys Lys Arg Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8000 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
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(8) TYPE: nucleic acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:96: 
QCCCTTTCTC TTCTTTTTTG GCTGCCOGCT TTCOGTCQG GAAAC 45 
(2) INFORMATION FOR SEQ ID N0:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPCLCGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CCAAAAAAGA AGAGAAAGGG CGGTQGTGCT TTGTCTCCT 39 
(2) INFORMATION FOR SEQ ID N0:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:98: 
ATATAGAGCT CTTAAACTGA TGATTTGATT TCAAATGC 
(2) INFORMATION FOR SEQ ID NO: 99: 



38 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:93: 
ATATAGAflCT CA3XGTTCV\ AAAGATTAGC GACGG 
(2) INFORMATION FOR SEQ ID N0:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:94: 
TATATATTAA TTAAATA6AA TGACACCTAC TCAGACMTG CGATQC 
(2) INFORMATION FOR SEQ ID N0:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:95: 
ATATACTCGA GTAGCAATQG TCAAACCAGT AACGTTATAC 
(2) INFORMATION FOR SEQ ID N0:96: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
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(xi) SEQUENCE DESCRIPTION: SEO ID NO:90: 
TASTGAACCG TATTGACGGC GTASTACACA CTATT 35 
(2) INFORMATION FOR SEQ ID N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:91: 
CGTTGflGCAT AACCGAATCT AC 22 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID N0:92: 
ATATAGfiQCT CTTAATTAAT CTITGTGAAG GAACCTTACT TC 42 
(2) INFORMATION FOR SEQ ID N0:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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TATATATATG GGGCCGCTCA ATTATGTTTC TGoTTGGT 
(2) INFORMATION FOR SEQ ID N0:88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TATATA3ATC TGGCQCGCCT TTACCACTCC TATC/CTGAT AG 
(2) INFORMATION FOR SEQ ID NO: 89: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:89: 
TACGCCGTCA ATACQGTTCA CTAAAC6AQC TCTGC 35 
(2) FORMATION FOR SEQ ID N0:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID N0:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:85: 
C/CGGTCCTG AQGTGC 16 
(2) INFORMATION FOR SEQ ID N0:86: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:86: 

TATATATATC T0GA6ACCGC CAAGATGTTC CCCTDCCA3C CA 42 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(0 STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:87: 



WO 97/38087 



PCT/US97/06010 



220 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CCACAAGCTT GGACCACCGT AGAG 
(2) INFORMATION FOR SEQ ID NO:83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:83: 
COGCGTQGCG GATOCCCTG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 

* (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CCACQGATCC QGAAGGSACA GAAG 



24 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:79: 
CCACAflGCTT COGGflGTTAC CCGAGlGftCC 30 
(2) INFORMATION FOR SEQ ID N0:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
CCACCTT/YG OjTCGGCTTT TTCTTC 26 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 
CCACCTTAAG AGAAGAGAAA GAATG 25 
(2) INFORMATION FOR SEQ ID N0:82: 
(i) SEQUENCE CHARACTERISTICS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CCAjGGATCC CQQCGITCCG TCC 23 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CCACAAGCTT GTGCACTGGG ATCTG 25 
(2) INFORMATION FOR SEQ ID N0:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPCLCGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CCAQQGATCC GTQCACATGA AGTCC 25 
(2) INFORMATION FOR SEQ ID N0:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73: 
ACACATTAAT TAAGTATTGG CCCCMTQQG GTCT 
(2) IiraMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CCAC6AATTC GGTCCTAAAT ASATGC 
(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCACAAGCTT CCGGGCGAQG CCOCC 
(2) INFORMATION FOR SEQ ID N0:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i ) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
ATATAGGCGG CCGCTCATCT TCGTGTGCTA GTCAQCATC 39 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
ACACATTAAT TAACGATGCC GCCGGAAGCG AGAA 34 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:68: 
ATATACTCGA GACCACCACC AT6AATAGAG GATTC 35 
(2) INFORMATION FOR SEQ ID N0:69: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: 
WATGCGGC CGCTATTACC ACTCTTCTGT CCCTTCCGQG GT 42 
(2) INFORMATION FOR SEQ ID N0:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 70: 
ATATACTCGA GAGCAATGTC CGCAQCACCA CTQGTCACQG CA 



42 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:65: 
CATGGGAAAC GATOCTCATC CTTACAATCG TGGTTTTCAA AGS 43 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: 
GATGAGGATC GTTTCGCATG ATTGA 25 
(2) INFORMATION FOR SEQ ID NO: 67: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
TATATATGCA TTCAGAAGAA CTOGTCAAGA AGGCGA 
(2) INFORMATION FOR SEQ ID NO: 68: 



36 
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(xi) SEQUENCE DESCRIPTION: SEO ID N0:62: 
GATATAGTGG TCAGTATCGC CG 22 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:63: 
TATATGGATC (IGGAGATCC TTAATCTTCT CATG 34 
(2) INFORMATION FOR SEQ ID N0:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64: 
TATATATOCA TCCCCCCCCC CCCCAADG 28 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: "SEQID 
TATATGAATT OGTTTGGACA AACCACAACT AGAATG 36 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
TATATAGATC TAGTCTTATG CAATACTCTT GTAGT 35 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTICN: SEQ ID N0:61: 
GGGATACTCA CCACTATATC TOGACGGTAT CGAQGTAGGG CACT 44 
(2) INFORMATION FOR SEQ ID N0:62: 

(i) SEQUENCE QWRACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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ATATATCCQG AGTCCGGCGG CTTQffiTO^ GA93CTA 37 
(2) IlfCRMATICN FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:57: 
ATATAGGATC CTCAGAASAA CTCGTCAAGA AGGCGA 36 
(2) INFCRMATICN FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58: 
TATATATGAG CTCTTACAAA TAAAQCAATA GCATCACAAA TTTC 44 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE (KARACTERIST1CS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) ST1WDEDNESS: single 

(D) TOPOLOGY: linear 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
CTGTTTAAAC AGATCTTATC TCGAGTATGC QQCCGCTATG AATTCGTTTA AACGA 55 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE OiARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:55: 
TCGTTTAAAC GAATTCATAG OGGCCGCATA CTCGAGATAA GATCTGTTTA AACAG 55 
(2) INFORMATION FOR SEQ ID N0:56: 

( i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
* (B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID ND.-51:. 
TATATAGGAT CCTAGSTTAG GTTGGMTCT AAMTACACA. AAC 43 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:52: 
TC6ATCCTA5 GA 12 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:53: 
QCTCTTAATT AAGAGC 16 
(2) INFORMATION FOR SEQ ID NO: 54: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 
CCTATGCQ3C CGCTAGCTG GAC^GCCAAT GACG 34 
(2) INFORMATION FCR SEQ ID NO: 49: 

(i) SEQUENCE CHAWOERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
CCTATTGGCC AGCCAAGACA TCATCCQQQC AS 32 
(2) INFORMATION FOR SEQ ID N0:50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 

TATATATCCG 6AAAQCTCTA AGGTAAATAT AAAATTTTT 39 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 43 base pairs 
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(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGTAACAAGA TCT03TGCCG TG 22 
(2) IfFCRMTION FCR SEQ ID NO: 46: 

(T) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID N0:46: 
CCTATGCGQC CGCGTQSAAC CTTTGTGGCT CCTC 34 
(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
CCTATTG3CC AGCAGACCAA TTTATQOCTA C 31 
(2) INFORMATION FCR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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CCGTCAATAC QGTTOVCTM AGGAQCTCTG CTTATATAGA CC 42 
(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 
GCTCGTTTAG TCAAOjGTAT TGACGGCGTA GTACACAC 38 
(2) INFORMATION FOR SEQ ID N0:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
CTQ3CAACCG GTAA3TACGA TAC 23 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMHWESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
TATATGAGCT CCTCCCTT/C CCATCCGCT QGACGTGCGT CCTCCTTCQG ATGCCCAGGT 60 
CGGACCQCG 69 
(2) INFORMATION FOR SEQ ID N0:41: 

(i ) SEQUENCE OWCTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPT1CN: SEQ ID N0:41: 

TATATAIAGA TCTTTGACAT TGATTA1TGA CTAG 34 

(2) INFORMATION FOR SEQ ID N0:42: 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 
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(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TATATATGAG CTCTMTAAA ATGOGGAAAT TGCATOGCAT TGTC 44 
(2) IffCRmTION FOR SEQ ID N0:38: 

(i) SEOJENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 
TATATGAATT CATAGAATGA CACCTACTCA GACAATGOGA TGC 43 
(2) INFORMATION FOR SEQ ID N0:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 
AA AA A AAAAA GGGTCGGCAT GGCATCTCCA CCTOCTCGCG GTCCGACCTG GQCATC 
(2) INFORMATION FOR SEQ ID N0:40: 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:34: 
/ffiATCCTAG T 11 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 
CTCGATCCTA GGATC6AGQC 20 
(2) INFORMATION FOR SEQ ID N0:36: 

(i) SEQUENCE OWRACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 
GAQCTAGGAT CCTAGCTC03 20 
(2) INFORMATION FOR SEQ ID N0:37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
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; TATATATATA TG033COGCC GCTAG3CCCC MTGATCGGA G 41 

] 

(2) INFORMATION FCR SEQ ID N0:32: 

t 

! (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO:32: 
CTATAGAGCT OGTTTAAACT llllllllll llllllllll [ 1 1 1 1 1 1 1 1 1 I II II II I K, 60 
AAATG 65 
(2) INFORMATION FOR SEQ ID N0:33: 

( i ) SEQUENCE 01ARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 
TCGATCCTA3 GA 12 
(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID N0:29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(0 S7RANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) sequence description: seq id n0:29: 
gaagtcccag a/ogcctac cg 22 
(2) informaticn for seq id n0:30: 
(i) sequence characteristics: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTJEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
TATATCTCGA QGGTQGTGTT GTAGTATTAG TOG 34 
(2) INFOPMTICN FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 
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(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 
ATATACTCGA GACCATGATT GAACAAGATG GATTG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TATATAQCGG CCGCTCAGAA GAACTCGTCA AGAAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 

' (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 
CTGTAGATGG TGACGGTGTC G 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 
TATATTCTAG AMI I III 1 1 1111111111 TTTTTTT7TT T7TTTTGAAA TG 52 
(2) INFORWJICN FOR SEQ ID N0:24: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATATATCTAG AGCCATQQGC CACACACGGA GGCAG 35 
(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATATAGGATC CCTGTTATAC AGQGCGTACA CTTTC 35 
(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TGQGTCTTCA ACTCACCQGA C 
(2) INFORMftJICN FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CMTTCGACG TACQCCTCAC TC 
(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi)'SEQUENCE DESCRIPTION: SEQ ID N0:22: 
GAGTGAGGCG TACGTCGAAT TG 
(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE MARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIFTICN: SEQ ID NO: 17: 
CCTTTTC#£ G^TCCQCCA C 21 
(2) IlfTOKTIGN FOR SEQ ID NO: 18: 

(1) SEQUENCE CHARACTERISTIC: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
6TGGCGGATC CCCTGAAAA6 G 21 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTICN: SEQ ID NO: 19: 
TQ3QCCGTGT GGTOGTCATG 20 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

. (0 STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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FURTHER INFORMATION C NTTNUED FROM PCT/ISA/ 210 



1. Claims 1-9. 18-25 and partially 17 : 

An isolated nucleic acid molecule comprising an altered alphavirus protein gene which, when 
operably incorporated into a recombinant alphavirus particle , 

. increases the time required to reach 50% inhibition of host-cell directed 
macromolecuiar synthesis following expression in mammalian cells, or 

. has a reduced level of vector-specific RNA synthesis and the same or greater level of 
proteins encoded by RNA transcribed from the viral junction promoter, 
as compared to a wild-type alphavirus particule; 

an alphavirus vector construct. RNA replicon or eukaryotic layered initiation system including 
it: an alphavirus producer cell line comprising one of these vectors, a recombinant alphavirus 
particle comprising the replicon. a host cell infected by this particle or containing the replicon. 
pharmaceutical compositions comprising these replicon or particle and methods for utilizing 
these alphavirus-based vectors. 



2. Claims 10-14 : 

A Togavirus capsid particle which contains substantially no genomic or RNA vector replicon 
nuc!eic acids and pharmaceutical composition comprising thereof. 

3. Claims 15. 16 and partially 17 : 

An alphavirus structural. protein expression cassette, comprising a 5* promoter which initiates 
synthesis of RNA from DNA. a nucieic acid molecule which encodes one or more functional 
alphavirus structural proteins, a selectable marker operably linked to transcription of the 
expression cassette . and a 3* sequence which controls transcription termination; alphavirus 
packaging and producer ceil lines comprising cells containing this cassette. 



REMARK: As far as claim 21 is directed to a method of treatment of 
the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/composition. 
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1. x Claims Nos. 



because they relate to subject matter not required to be searched by this Authority, namely: 

See annex. 



2. Q Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be earned out, specifically: 



□ 



3. Claims Nos. 



because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first shoot) 



This International Searching Authority found multiple inventions in this international application, as follows : 



See annex 



1. 



□ As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims 



2. [^J As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. 



3- | ) As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
l — * covers only those claims tor which fees were paid, specifically claims Nos. : 



No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-9, 18-25 and partially 17. 



Remark on Protest 



[ | The additional search fees were accompanied by the applicants protest. 
| [ No protest accompanied the payment of additional search fees. 
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junction region promoter, as compared to a wild-type recombinant alphavirus particle. Also disclosed are RNA vector replicons, alphavirus 
vector constructs, and eukaryotic layered vector initiation systems which contain the above-identified nucleic acid molecules. 
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SNBS-luciferase Vector Packaging by 
i pBGSVCMVdlhyg Packaging Cells 




E2 transfection 
D infection 

1. BHK control 

2. pBGSVCMVdlhyg PCL 
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Cells Transfected 



Fig. 39 
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Fig. 31 A 



Fig. 31B 
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Amplification of Reporter Gene Expression 
by Transfection of PCLs with ELVS-lac Z 
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Fig. 28 
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Sequential Amplification and Recombinant Protein 
Expression Using SNBS™ Vectors and PCL 




Fig. 27 
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Ooto- 

replicon (vector) 
1. SINrep/lacZ 



2. 



3. Sinrep/capsid 



helperfoll tRNA 5') 
DHBBCrrvSin-gps 



DHBBCsin + 

DHBB(CdeletedRRVSin-gps) 
DHBB(CdeletedRRVSin-gps) 



inf units/ml 
5x10-8 

5-10x10-9 



pfu/ml 

1.5x10-6 
pinpoint 
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the latter (as well as gels) show that only the Sindbis virus capsid gets incorporated into 
extracellular particles when both are present in the tronsfected cell. 

Ill constructs 

A TotollOI - sites for the insertion ot the RRV capsid gene 

B RRV - sites used tor cloning of the RRV capsid gene 

C chimeras ]) complete RRV capsid gene 

2) RRV copsid gene nts 7835-7888 deleted. 



Fig. 26 
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